1
|
Goecker ZC, Burke MC, Remoroza CA, Liu Y, Mirokhin YA, Sheetlin SL, Tchekhovskoi DV, Yang X, Stein SE. Variation of Site-Specific Glycosylation Profiles of Recombinant Influenza Glycoproteins. Mol Cell Proteomics 2024; 23:100827. [PMID: 39128790 PMCID: PMC11417209 DOI: 10.1016/j.mcpro.2024.100827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 07/08/2024] [Accepted: 08/07/2024] [Indexed: 08/13/2024] Open
Abstract
This work presents a detailed determination of site-specific N-glycan distributions of the recombinant influenza glycoproteins hemagglutinin (HA) and neuraminidase. Variation in glycosylation among recombinant glycoproteins is not predictable and can depend on details of the biomanufacturing process as well as details of protein structure. In this study, recombinant influenza proteins were analyzed from eight strains of four different suppliers. These include five HA and three neuraminidase proteins, each produced from a HEK293 cell line. Digestion was conducted using a series of complex multienzymatic methods designed to isolate glycopeptides containing single N-glycosylated sites. Site-specific glycosylation profiles of intact glycopeptides were produced using a recently developed method and comparisons were made using spectral similarity scores. Variation in glycan abundances and distribution was most pronounced between different strains of virus (similarity score = 383 out of 999), whereas digestion replicates and injection replicates showed relatively little variation (similarity score = 957). Notably, glycan distributions for homologous regions of influenza glycoprotein variants showed low variability. Due to the multiple possible sources of variation and inherent analytical difficulties in site-specific glycan determinations, variations were individually examined for multiple factors, including differences in supplier, production batch, protease digestion, and replicate measurement. After comparing all glycosylation distributions, four distinguishable classes could be identified for the majority of sites. Finally, attempts to identify glycosylation distributions on adjacent potential N-glycosylated sites of one HA variant were made. Only the second site (NnST) was found to be occupied using two rarely used proteases in proteomics, subtilisin and esperase, both of which did selectively cleave these adjacent sites.
Collapse
Affiliation(s)
- Zachary C Goecker
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA.
| | - Meghan C Burke
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| | - Concepcion A Remoroza
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| | - Yi Liu
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| | - Yuri A Mirokhin
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| | - Sergey L Sheetlin
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| | - Dmitrii V Tchekhovskoi
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| | - Xiaoyu Yang
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| | - Stephen E Stein
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Gaithersburg, Maryland, USA
| |
Collapse
|
2
|
Hackett WE, Chang D, Carvalho L, Zaia J. RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein composition that occur during biological processes. BIOINFORMATICS ADVANCES 2024; 4:vbae012. [PMID: 38384861 PMCID: PMC10879752 DOI: 10.1093/bioadv/vbae012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 12/15/2023] [Accepted: 01/22/2024] [Indexed: 02/23/2024]
Abstract
Motivation Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically synthesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. Glycoproteins, accounting for approximately half of all proteins, require specialized proteomics data analysis methods due to micro- and macro-heterogeneities as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, a specialized toolset is needed to determine if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations. Results We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses a permutation test to generate contextual similarity, which assesses the quality of mass spectral data and outputs a graphical demonstration of the likelihood of finding biologically significant differences in glycosylation abundance datasets. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern change. RAMZIS is validated by theoretical cases and a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using this tool, researchers will be able to rigorously define the role of glycosylation and the changes that occur during biological processes. Availability and implementation https://github.com/WillHackett22/RAMZIS.
Collapse
Affiliation(s)
| | - Deborah Chang
- Department of Biochemistry, Boston University, Boston, MA 02215, United States
| | - Luis Carvalho
- Bioinformatics Program, Boston University, Boston, MA 02215, United States
- Department of Mathematics, Boston University, Boston, MA 02215, United States
| | - Joseph Zaia
- Bioinformatics Program, Boston University, Boston, MA 02215, United States
- Department of Biochemistry, Boston University, Boston, MA 02215, United States
| |
Collapse
|
3
|
Chatterjee S, Zaia J. Proteomics-based mass spectrometry profiling of SARS-CoV-2 infection from human nasopharyngeal samples. MASS SPECTROMETRY REVIEWS 2024; 43:193-229. [PMID: 36177493 PMCID: PMC9538640 DOI: 10.1002/mas.21813] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 09/07/2022] [Accepted: 09/09/2022] [Indexed: 05/12/2023]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of the on-going global pandemic of coronavirus disease 2019 (COVID-19) that continues to pose a significant threat to public health worldwide. SARS-CoV-2 encodes four structural proteins namely membrane, nucleocapsid, spike, and envelope proteins that play essential roles in viral entry, fusion, and attachment to the host cell. Extensively glycosylated spike protein efficiently binds to the host angiotensin-converting enzyme 2 initiating viral entry and pathogenesis. Reverse transcriptase polymerase chain reaction on nasopharyngeal swab is the preferred method of sample collection and viral detection because it is a rapid, specific, and high-throughput technique. Alternate strategies such as proteomics and glycoproteomics-based mass spectrometry enable a more detailed and holistic view of the viral proteins and host-pathogen interactions and help in detection of potential disease markers. In this review, we highlight the use of mass spectrometry methods to profile the SARS-CoV-2 proteome from clinical nasopharyngeal swab samples. We also highlight the necessity for a comprehensive glycoproteomics mapping of SARS-CoV-2 from biological complex matrices to identify potential COVID-19 markers.
Collapse
Affiliation(s)
- Sayantani Chatterjee
- Department of Biochemistry, Center for Biomedical Mass SpectrometryBoston University School of MedicineBostonMassachusettsUSA
| | - Joseph Zaia
- Department of Biochemistry, Center for Biomedical Mass SpectrometryBoston University School of MedicineBostonMassachusettsUSA
- Bioinformatics ProgramBoston University School of MedicineBostonMassachusettsUSA
| |
Collapse
|
4
|
Downs M, Zaia J, Sethi MK. Mass spectrometry methods for analysis of extracellular matrix components in neurological diseases. MASS SPECTROMETRY REVIEWS 2023; 42:1848-1875. [PMID: 35719114 PMCID: PMC9763553 DOI: 10.1002/mas.21792] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 04/12/2022] [Accepted: 05/24/2022] [Indexed: 06/15/2023]
Abstract
The brain extracellular matrix (ECM) is a highly glycosylated environment and plays important roles in many processes including cell communication, growth factor binding, and scaffolding. The formation of structures such as perineuronal nets (PNNs) is critical in neuroprotection and neural plasticity, and the formation of molecular networks is dependent in part on glycans. The ECM is also implicated in the neuropathophysiology of disorders such as Alzheimer's disease (AD), Parkinson's disease (PD), and Schizophrenia (SZ). As such, it is of interest to understand both the proteomic and glycomic makeup of healthy and diseased brain ECM. Further, there is a growing need for site-specific glycoproteomic information. Over the past decade, sample preparation, mass spectrometry, and bioinformatic methods have been developed and refined to provide comprehensive information about the glycoproteome. Core ECM molecules including versican, hyaluronan and proteoglycan link proteins, and tenascin are dysregulated in AD, PD, and SZ. Glycomic changes such as differential sialylation, sulfation, and branching are also associated with neurodegeneration. A more thorough understanding of the ECM and its proteomic, glycomic, and glycoproteomic changes in brain diseases may provide pathways to new therapeutic options.
Collapse
Affiliation(s)
- Margaret Downs
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA
| | - Joseph Zaia
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA
- Bioinformatics Program, Boston University, Boston, Massachusetts, USA
| | - Manveen K Sethi
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA
| |
Collapse
|
5
|
Hollander MJ, Malaker SA, Riley NM, Perez I, Abney NM, Gray MA, Maxson JE, Cochran JR, Bertozzi CR. Mutational screens highlight glycosylation as a modulator of colony-stimulating factor 3 receptor (CSF3R) activity. J Biol Chem 2023; 299:104755. [PMID: 37116708 PMCID: PMC10245049 DOI: 10.1016/j.jbc.2023.104755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 04/21/2023] [Accepted: 04/23/2023] [Indexed: 04/30/2023] Open
Abstract
The colony-stimulating factor 3 receptor (CSF3R) controls the growth of neutrophils, the most abundant type of white blood cell. In healthy neutrophils, signaling is dependent on CSF3R binding to its ligand, CSF3. A single amino acid mutation in CSF3R, T618I, instead allows for constitutive, ligand-independent cell growth and leads to a rare type of cancer called chronic neutrophilic leukemia. However, the disease mechanism is not well understood. Here, we investigated why this threonine to isoleucine substitution is the predominant mutation in chronic neutrophilic leukemia and how it leads to uncontrolled neutrophil growth. Using protein domain mapping, we demonstrated that the single CSF3R domain containing residue 618 is sufficient for ligand-independent activity. We then applied an unbiased mutational screening strategy focused on this domain and found that activating mutations are enriched at sites normally occupied by asparagine, threonine, and serine residues-the three amino acids which are commonly glycosylated. We confirmed glycosylation at multiple CSF3R residues by mass spectrometry, including the presence of GalNAc and Gal-GalNAc glycans at WT threonine 618. Using the same approach applied to other cell surface receptors, we identified an activating mutation, S489F, in the interleukin-31 receptor alpha chain. Combined, these results suggest a role for glycosylated hotspot residues in regulating receptor signaling, mutation of which can lead to ligand-independent, uncontrolled activity and human disease.
Collapse
Affiliation(s)
- Michael J Hollander
- Department of Bioengineering, Stanford University, Stanford, California, USA; Department of Chemistry and Sarafan ChEM-H, Stanford University, Stanford, California, USA
| | - Stacy A Malaker
- Department of Chemistry and Sarafan ChEM-H, Stanford University, Stanford, California, USA
| | - Nicholas M Riley
- Department of Chemistry and Sarafan ChEM-H, Stanford University, Stanford, California, USA
| | - Idalia Perez
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Nayla M Abney
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Melissa A Gray
- Department of Chemistry and Sarafan ChEM-H, Stanford University, Stanford, California, USA
| | - Julia E Maxson
- Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon, USA
| | - Jennifer R Cochran
- Department of Bioengineering, Stanford University, Stanford, California, USA; Department of Chemical Engineering, Stanford University, Stanford, California, USA.
| | - Carolyn R Bertozzi
- Department of Chemistry and Sarafan ChEM-H, Stanford University, Stanford, California, USA; Howard Hughes Medical Institute, Stanford, California, USA.
| |
Collapse
|
6
|
Hackett WE, Chang D, Carvalho L, Zaia J. RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein structure that occur during biological processes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.30.542895. [PMID: 37398011 PMCID: PMC10312533 DOI: 10.1101/2023.05.30.542895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Motivation Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically syn-thesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. While glycoproteins account for approximately half of all proteins, their macro- and micro-heterogeneity requires specialized proteomics data analysis methods as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, this necessitated specialized statistical metrics to identify if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations. Results We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide biomedical researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses contextual similarity to assess the quality of mass spectral data and generates graphical output that demonstrates the likelihood of finding biologically significant differences in glycosylation abundance dataset. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern expression change. Herein RAMZIS approach is validated by theoretical cases and by a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using our tool, researchers will be able to rigor-ously define the role of glycosylation and the changes that occur during biological processes.
Collapse
Affiliation(s)
| | - Deborah Chang
- Department of Biochemistry, Boston University, One Silber Way, Boston 02215
| | - Luis Carvalho
- Boston University, Bioinformatics Program, One Silber Way, Boston 02215, MA, USA
- Department of Mathematics, Boston University, One Silber Way, Boston 02215
| | - Joseph Zaia
- Boston University, Bioinformatics Program, One Silber Way, Boston 02215, MA, USA
- Department of Biochemistry, Boston University, One Silber Way, Boston 02215
| |
Collapse
|
7
|
Wong TL, Mooney BP, Cavallero GJ, Guan M, Li L, Zaia J, Wan XF. Glycoproteomic Analyses of Influenza A Viruses Using timsTOF Pro MS. J Proteome Res 2023; 22:62-77. [PMID: 36480915 DOI: 10.1021/acs.jproteome.2c00469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
N-Linked glycosylation in hemagglutinin and neuraminidase glycoproteins of influenza viruses affects antigenic and receptor binding properties, and precise analyses of site-specific glycoforms in these proteins are critical in understanding the antigenic and immunogenic properties of influenza viruses. In this study, we developed a glycoproteomic approach by using a timsTOF Pro mass spectrometer (MS) to determine the abundance and heterogeneity of site-specific glycosylation for influenza glycoproteins. Compared with a Q Exactive HF MS, the timsTOF Pro MS method without the hydrophilic interaction liquid chromatography column enrichment achieved similar glycopeptide coverage and quantities but was more effective in identifying low-abundance glycopeptides. We quantified the distributions of intact site-specific glycopeptides in hemagglutinin of A/chicken/Wuxi/0405005/2013 (H7N9) and A/mute swan/Rhode Island/A00325125/2008 (H7N3). Results showed that hemagglutinin for both viruses had complex N-glycans at N22, N38, N240, and N483 but only high-mannose glycans at N411 and, however, that the type and quantities of glycans were distinct between these viruses. Collisional cross section (CCS) provided by the ion mobility spectrometry from the timsTOF Pro MS data differentiated sialylation linkages of the glycopeptides. In summary, timsTOF Pro MS method can quantify intact site-specific glycans for influenza glycoproteins without enrichment and thus facilitate influenza vaccine development and production.
Collapse
Affiliation(s)
- Tin Long Wong
- Center for Influenza and Emerging Infectious Diseases, University of Missouri, Columbia, Missouri65211, United States.,Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, Missouri65211, United States.,Bond Life Sciences Center, University of Missouri, Columbia, Missouri65211, United States
| | - Brian P Mooney
- Department of Biochemistry and Charles W. Gehrke Proteomics Center, University of Missouri, Columbia, Missouri65211, United States
| | - Gustavo J Cavallero
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts02118, United States
| | - Minhui Guan
- Center for Influenza and Emerging Infectious Diseases, University of Missouri, Columbia, Missouri65211, United States.,Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, Missouri65211, United States.,Bond Life Sciences Center, University of Missouri, Columbia, Missouri65211, United States
| | - Lei Li
- Department of Chemistry, Georgia State University, Atlanta, Georgia30302, United States
| | - Joseph Zaia
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts02118, United States
| | - Xiu-Feng Wan
- Center for Influenza and Emerging Infectious Diseases, University of Missouri, Columbia, Missouri65211, United States.,Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, Missouri65211, United States.,Bond Life Sciences Center, University of Missouri, Columbia, Missouri65211, United States.,Department of Electrical Engineering & Computer Science, College of Engineering, University of Missouri, Columbia, Missouri65211, United States
| |
Collapse
|
8
|
Chang D, Zaia J. Methods to improve quantitative glycoprotein coverage from bottom-up LC-MS data. MASS SPECTROMETRY REVIEWS 2022; 41:922-937. [PMID: 33764573 DOI: 10.1002/mas.21692] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Revised: 12/24/2020] [Accepted: 03/11/2021] [Indexed: 05/18/2023]
Abstract
Advances in mass spectrometry instrumentation, methods development, and bioinformatics have greatly improved the ease and accuracy of site-specific, quantitative glycoproteomics analysis. Data-dependent acquisition is the most popular method for identification and quantification of glycopeptides; however, complete coverage of glycosylation site glycoforms remains elusive with this method. Targeted acquisition methods improve the precision and accuracy of quantification, but at the cost of throughput and discoverability. Data-independent acquisition (DIA) holds great promise for more complete and highly quantitative site-specific glycoproteomics analysis, while maintaining the ability to discover novel glycopeptides without prior knowledge. We review additional features that can be used to increase selectivity and coverage to the DIA workflow: retention time modeling, which would simplify the interpretation of complex tandem mass spectra, and ion mobility separation, which would maximize the sampling of all precursors at a giving chromatographic retention time. The instrumentation and bioinformatics to incorporate these features into glycoproteomics analysis exist. These improvements in quantitative, site-specific analysis will enable researchers to assess glycosylation similarity in related biological systems, answering new questions about the interplay between glycosylation state and biological function.
Collapse
Affiliation(s)
- Deborah Chang
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Joseph Zaia
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts, USA
| |
Collapse
|
9
|
Chang D, Klein J, Hackett WE, Nalehua MR, Wan XF, Zaia J. Improving Statistical Certainty of Glycosylation Similarity between Influenza A Virus Variants Using Data-Independent Acquisition Mass Spectrometry. Mol Cell Proteomics 2022; 21:100412. [PMID: 36103992 PMCID: PMC9593740 DOI: 10.1016/j.mcpro.2022.100412] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Revised: 08/22/2022] [Accepted: 09/08/2022] [Indexed: 01/18/2023] Open
Abstract
Amino acid sequences of immunodominant domains of hemagglutinin (HA) on the surface of influenza A virus (IAV) evolve rapidly, producing viral variants. HA mediates receptor recognition, binding and cell entry, and serves as the target for IAV vaccines. Glycosylation, a post-translational modification that places large branched polysaccharide molecules on proteins, can modulate the function of HA and shield antigenic regions allowing for viral evasion from immune responses. Our previous work showed that subtle changes in the HA protein sequence can have a measurable change in glycosylation. Thus, being able to quantitatively measure glycosylation changes in variants is critical for understanding how HA function may change throughout viral evolution. Moreover, understanding quantitatively how the choice of viral expression systems affects glycosylation can help in the process of vaccine design and manufacture. Although IAV vaccines are most commonly expressed in chicken eggs, cell-based vaccines have many advantages, and the adoption of more cell-based vaccines would be an important step in mitigating seasonal influenza and protecting against future pandemics. Here, we have investigated the use of data-independent acquisition (DIA) mass spectrometry for quantitative glycoproteomics. We found that DIA improved the sensitivity of glycopeptide detection for four variants of A/Switzerland/9715293/2013 (H3N2): WT and mutant, each expressed in embryonated chicken eggs and Madin-Darby canine kidney cells. We used the Tanimoto similarity metric to quantify changes in glycosylation between WT and mutant and between egg-expressed and cell-expressed virus. Our DIA site-specific glycosylation similarity comparison of WT and mutant expressed in eggs confirmed our previous analysis while achieving greater depth of coverage. We found that sequence variations and changing viral expression systems affected distinct glycosylation sites of HA. Our methods can be applied to track glycosylation changes in circulating IAV variants to bolster genomic surveillance already being done, for a more complete understanding of IAV evolution.
Collapse
Affiliation(s)
- Deborah Chang
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts, USA
| | - Joshua Klein
- Boston University Bioinformatics Program, Boston University, Boston, Massachusetts, USA
| | - William E Hackett
- Boston University Bioinformatics Program, Boston University, Boston, Massachusetts, USA
| | - Mary Rachel Nalehua
- Boston University Bioinformatics Program, Boston University, Boston, Massachusetts, USA
| | - Xiu-Feng Wan
- Center for Influenza and Emerging Infectious Diseases, University of Missouri, Columbia, Missouri, USA; Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, Missouri, USA; Department of Electrical Engineering & Computer Science, College of Engineering, University of Missouri, Columbia, Missouri, USA; Bond Life Sciences Center, University of Missouri, Columbia, Missouri, USA
| | - Joseph Zaia
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University School of Medicine, Boston, Massachusetts, USA; Boston University Bioinformatics Program, Boston University, Boston, Massachusetts, USA.
| |
Collapse
|
10
|
Wang Y, Tang CY, Wan XF. Antigenic characterization of influenza and SARS-CoV-2 viruses. Anal Bioanal Chem 2022; 414:2841-2881. [PMID: 34905077 PMCID: PMC8669429 DOI: 10.1007/s00216-021-03806-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 11/21/2021] [Accepted: 11/24/2021] [Indexed: 12/24/2022]
Abstract
Antigenic characterization of emerging and re-emerging viruses is necessary for the prevention of and response to outbreaks, evaluation of infection mechanisms, understanding of virus evolution, and selection of strains for vaccine development. Primary analytic methods, including enzyme-linked immunosorbent/lectin assays, hemagglutination inhibition, neuraminidase inhibition, micro-neutralization assays, and antigenic cartography, have been widely used in the field of influenza research. These techniques have been improved upon over time for increased analytical capacity, and some have been mobilized for the rapid characterization of the SARS-CoV-2 virus as well as its variants, facilitating the development of highly effective vaccines within 1 year of the initially reported outbreak. While great strides have been made for evaluating the antigenic properties of these viruses, multiple challenges prevent efficient vaccine strain selection and accurate assessment. For influenza, these barriers include the requirement for a large virus quantity to perform the assays, more than what can typically be provided by the clinical samples alone, cell- or egg-adapted mutations that can cause antigenic mismatch between the vaccine strain and circulating viruses, and up to a 6-month duration of vaccine development after vaccine strain selection, which allows viruses to continue evolving with potential for antigenic drift and, thus, antigenic mismatch between the vaccine strain and the emerging epidemic strain. SARS-CoV-2 characterization has faced similar challenges with the additional barrier of the need for facilities with high biosafety levels due to its infectious nature. In this study, we review the primary analytic methods used for antigenic characterization of influenza and SARS-CoV-2 and discuss the barriers of these methods and current developments for addressing these challenges.
Collapse
Affiliation(s)
- Yang Wang
- MU Center for Influenza and Emerging Infectious Diseases (CIEID), University of Missouri, Columbia, MO, USA
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Cynthia Y Tang
- MU Center for Influenza and Emerging Infectious Diseases (CIEID), University of Missouri, Columbia, MO, USA
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Xiu-Feng Wan
- MU Center for Influenza and Emerging Infectious Diseases (CIEID), University of Missouri, Columbia, MO, USA.
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA.
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA.
- Department of Electrical Engineering & Computer Science, College of Engineering, University of Missouri, Columbia, MO, USA.
| |
Collapse
|
11
|
Sethi MK, Downs M, Shao C, Hackett WE, Phillips JJ, Zaia J. In-Depth Matrisome and Glycoproteomic Analysis of Human Brain Glioblastoma Versus Control Tissue. Mol Cell Proteomics 2022; 21:100216. [PMID: 35202840 PMCID: PMC8957055 DOI: 10.1016/j.mcpro.2022.100216] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 02/01/2022] [Accepted: 02/03/2022] [Indexed: 12/14/2022] Open
Abstract
Glioblastoma (GBM) is the most common and malignant primary brain tumor. The extracellular matrix, also known as the matrisome, helps determine glioma invasion, adhesion, and growth. Little attention, however, has been paid to glycosylation of the extracellular matrix components that constitute the majority of glycosylated protein mass and presumed biological properties. To acquire a comprehensive understanding of the biological functions of the matrisome and its components, including proteoglycans (PGs) and glycosaminoglycans (GAGs), in GBM tumorigenesis, and to identify potential biomarker candidates, we studied the alterations of GAGs, including heparan sulfate (HS) and chondroitin sulfate (CS), the core proteins of PGs, and other glycosylated matrisomal proteins in GBM subtypes versus control human brain tissue samples. We scrutinized the proteomics data to acquire in-depth site-specific glycoproteomic profiles of the GBM subtypes that will assist in identifying specific glycosylation changes in GBM. We observed an increase in CS 6-O sulfation and a decrease in HS 6-O sulfation, accompanied by an increase in unsulfated CS and HS disaccharides in GBM versus control samples. Several core matrisome proteins, including PGs (decorin, biglycan, agrin, prolargin, glypican-1, and chondroitin sulfate proteoglycan 4), tenascin, fibronectin, hyaluronan link protein 1 and 2, laminins, and collagens, were differentially regulated in GBM versus controls. Interestingly, a higher degree of collagen hydroxyprolination was also observed for GBM versus controls. Further, two PGs, chondroitin sulfate proteoglycan 4 and agrin, were significantly lower, about 6-fold for isocitrate dehydrogenase-mutant, compared to the WT GBM samples. Differential regulation of O-glycopeptides for PGs, including brevican, neurocan, and versican, was observed for GBM subtypes versus controls. Moreover, an increase in levels of glycosyltransferase and glycosidase enzymes was observed for GBM when compared to control samples. We also report distinct protein, peptide, and glycopeptide features for GBM subtypes comparisons. Taken together, our study informs understanding of the alterations to key matrisomal molecules that occur during GBM development. (Data are available via ProteomeXchange with identifier PXD028931, and the peaks project file is available at Zenodo with DOI 10.5281/zenodo.5911810).
Collapse
Affiliation(s)
- Manveen K Sethi
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA
| | - Margaret Downs
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA
| | - Chun Shao
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA
| | - William E Hackett
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA; Bioinformatics Program, Boston University, Boston, Massachusetts, USA
| | - Joanna J Phillips
- Department of Neurological Surgery, Brain Tumor Center, Helen Diller Family Cancer Research Center, University of California San Francisco, San Francisco, California, USA; Division of Neuropathology, Department of Pathology, University of California San Francisco, San Francisco, California, USA
| | - Joseph Zaia
- Department of Biochemistry, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts, USA; Bioinformatics Program, Boston University, Boston, Massachusetts, USA.
| |
Collapse
|
12
|
Fang P, Ji Y, Oellerich T, Urlaub H, Pan KT. Strategies for Proteome-Wide Quantification of Glycosylation Macro- and Micro-Heterogeneity. Int J Mol Sci 2022; 23:ijms23031609. [PMID: 35163546 PMCID: PMC8835892 DOI: 10.3390/ijms23031609] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 01/26/2022] [Accepted: 01/27/2022] [Indexed: 12/03/2022] Open
Abstract
Protein glycosylation governs key physiological and pathological processes in human cells. Aberrant glycosylation is thus closely associated with disease progression. Mass spectrometry (MS)-based glycoproteomics has emerged as an indispensable tool for investigating glycosylation changes in biological samples with high sensitivity. Following rapid improvements in methodologies for reliable intact glycopeptide identification, site-specific quantification of glycopeptide macro- and micro-heterogeneity at the proteome scale has become an urgent need for exploring glycosylation regulations. Here, we summarize recent advances in N- and O-linked glycoproteomic quantification strategies and discuss their limitations. We further describe a strategy to propagate MS data for multilayered glycopeptide quantification, enabling a more comprehensive examination of global and site-specific glycosylation changes. Altogether, we show how quantitative glycoproteomics methods explore glycosylation regulation in human diseases and promote the discovery of biomarkers and therapeutic targets.
Collapse
Affiliation(s)
- Pan Fang
- Department of Biochemistry and Molecular Biology, School of Biology & Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou 215123, China;
| | - Yanlong Ji
- Bioanalytical Mass Spectrometry Group, Max Planck Institute for Multidisciplinary Sciences, 37077 Göttingen, Germany;
- Hematology/Oncology, Department of Medicine II, Johann Wolfgang Goethe University, 60590 Frankfurt am Main, Germany;
- Frankfurt Cancer Institute, Johann Wolfgang Goethe University, 60596 Frankfurt am Main, Germany
| | - Thomas Oellerich
- Hematology/Oncology, Department of Medicine II, Johann Wolfgang Goethe University, 60590 Frankfurt am Main, Germany;
- Frankfurt Cancer Institute, Johann Wolfgang Goethe University, 60596 Frankfurt am Main, Germany
- German Cancer Consortium (DKTK), Partner Site Frankfurt/Mainz, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Henning Urlaub
- Bioanalytical Mass Spectrometry Group, Max Planck Institute for Multidisciplinary Sciences, 37077 Göttingen, Germany;
- Institute of Clinical Chemistry, University Medical Center Göttingen, 37075 Göttingen, Germany
- Correspondence: (H.U.); (K.-T.P.)
| | - Kuan-Ting Pan
- Hematology/Oncology, Department of Medicine II, Johann Wolfgang Goethe University, 60590 Frankfurt am Main, Germany;
- Frankfurt Cancer Institute, Johann Wolfgang Goethe University, 60596 Frankfurt am Main, Germany
- Correspondence: (H.U.); (K.-T.P.)
| |
Collapse
|
13
|
Data-independent acquisition mass spectrometry for site-specific glycoproteomics characterization of SARS-CoV-2 spike protein. Anal Bioanal Chem 2021; 413:7305-7318. [PMID: 34635934 PMCID: PMC8505113 DOI: 10.1007/s00216-021-03643-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 08/23/2021] [Accepted: 08/30/2021] [Indexed: 11/21/2022]
Abstract
The spike protein of SARS-CoV-2, the virus responsible for the global pandemic of COVID-19, is an abundant, heavily glycosylated surface protein that plays a key role in receptor binding and host cell fusion, and is the focus of all current vaccine development efforts. Variants of concern are now circulating worldwide that exhibit mutations in the spike protein. Protein sequence and glycosylation variations of the spike may affect viral fitness, antigenicity, and immune evasion. Global surveillance of the virus currently involves genome sequencing, but tracking emerging variants should include quantitative measurement of changes in site-specific glycosylation as well. In this work, we used data-dependent acquisition (DDA) and data-independent acquisition (DIA) mass spectrometry to quantitatively characterize the five N-linked glycosylation sites of the glycoprotein standard alpha-1-acid glycoprotein (AGP), as well as the 22 sites of the SARS-CoV-2 spike protein. We found that DIA compared favorably to DDA in sensitivity, resulting in more assignments of low-abundance glycopeptides. However, the reproducibility across replicates of DIA-identified glycopeptides was lower than that of DDA, possibly due to the difficulty of reliably assigning low-abundance glycopeptides confidently. The differences in the data acquired between the two methods suggest that DIA outperforms DDA in terms of glycoprotein coverage but that overall performance is a balance of sensitivity, selectivity, and statistical confidence in glycoproteomics. We assert that these analytical and bioinformatics methods for assigning and quantifying glycoforms would benefit the process of tracking viral variants as well as for vaccine development.
Collapse
|
14
|
Remoroza CA, Burke MC, Liu Y, Mirokhin YA, Tchekhovskoi DV, Yang X, Stein SE. Representing and Comparing Site-Specific Glycan Abundance Distributions of Glycoproteins. J Proteome Res 2021; 20:4475-4486. [PMID: 34327998 PMCID: PMC9830564 DOI: 10.1021/acs.jproteome.1c00442] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
A method for representing and comparing distributions of N-linked glycans located at specific sites on proteins is presented. The representation takes the form of a simple mass spectrum for a given peptide sequence, with each peak corresponding to a different glycopeptide. The mass (in place of m/z) of each peak is that of the glycan mass, and its abundance corresponds to its relative abundance in the electrospray MS1 spectrum. This provides a facile means of representing all identifiable glycopeptides arising from a single protein "sequon" on a specific sequence, thereby enabling the comparison and searching of these distributions as routinely done for mass spectra. Likewise, these reference glycopeptide abundance distribution spectra (GADS) can be stored in searchable libraries. A set of such libraries created from available data is provided along with an adapted version of the widely used NIST-MS library-search software. Since GADS contain only MS1 abundances and identifications, they are equally suitable for expressing collision-induced fragmentation and electron-transfer dissociation determinations of glycopeptide identity. Comparisons of GADS for N-glycosylated sites on several proteins, especially the SARS-CoV-2 spike protein, demonstrate the potential reproducibility of GADS and their utility for comparing site-specific distributions.
Collapse
|
15
|
Wang M, Shajahan A, Pepi LE, Azadi P, Zaia J. Glycoproteomic Sample Processing, LC-MS, and Data Analysis Using GlycReSoft. Curr Protoc 2021; 1:e84. [PMID: 33761173 DOI: 10.1002/cpz1.84] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Identification of N- and O-glycosylation on specific sites of proteins, along with glycan structural information, is necessary to determine the roles glycoproteins play in normal and pathologic cellular functions. Because such glycosylation is macro- and micro-heterogeneous and alters the dissociation behavior of glycopeptides, specific sample preparation, mass spectrometry, and data analysis techniques are required. Advanced tandem mass spectrometry-based glycoproteomics coupled with powerful data mining algorithms are key elements for characterization of protein glycosylation. This article includes the detailed, streamlined sample preparation method for liquid chromatography-mass spectrometry data acquisition and subsequent bioinformatics-based data annotation using the publicly available GlycReSoft program for highly efficient identification and quantification of glycoprotein glycosylation. © 2021 Wiley Periodicals LLC. Basic Protocol 1: Characterization of glycans and site occupancy on purified glycoprotein Support Protocol 1: In-gel digestion of glycoproteins Support Protocol 2: Detection of glycoproteins from cells/tissue through glycopeptide enrichment Basic Protocol 2: Acquisition of glycopeptides through high-resolution nano-LC-MS/MS Basic Protocol 3: Identification and quantification of glycopeptides using GlycReSoft.
Collapse
Affiliation(s)
- Meizhe Wang
- Department of Biochemistry, Boston University, Boston, Massachusetts
| | - Asif Shajahan
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia
| | - Lauren E Pepi
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia
| | - Parastoo Azadi
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia
| | - Joseph Zaia
- Department of Biochemistry, Boston University, Boston, Massachusetts.,Bioinformatics Program, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts
| |
Collapse
|
16
|
Wang M, Shajahan A, Pepi LE, Azadi P, Zaia J. Glycoproteomic Sample Processing, LC-MS, and Data Analysis Using GlycReSoft. Curr Protoc 2021. [PMID: 33761173 DOI: 10.1002/cpz1001.1084] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Identification of N- and O-glycosylation on specific sites of proteins, along with glycan structural information, is necessary to determine the roles glycoproteins play in normal and pathologic cellular functions. Because such glycosylation is macro- and micro-heterogeneous and alters the dissociation behavior of glycopeptides, specific sample preparation, mass spectrometry, and data analysis techniques are required. Advanced tandem mass spectrometry-based glycoproteomics coupled with powerful data mining algorithms are key elements for characterization of protein glycosylation. This article includes the detailed, streamlined sample preparation method for liquid chromatography-mass spectrometry data acquisition and subsequent bioinformatics-based data annotation using the publicly available GlycReSoft program for highly efficient identification and quantification of glycoprotein glycosylation. © 2021 Wiley Periodicals LLC. Basic Protocol 1: Characterization of glycans and site occupancy on purified glycoprotein Support Protocol 1: In-gel digestion of glycoproteins Support Protocol 2: Detection of glycoproteins from cells/tissue through glycopeptide enrichment Basic Protocol 2: Acquisition of glycopeptides through high-resolution nano-LC-MS/MS Basic Protocol 3: Identification and quantification of glycopeptides using GlycReSoft.
Collapse
Affiliation(s)
- Meizhe Wang
- Department of Biochemistry, Boston University, Boston, Massachusetts
| | - Asif Shajahan
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia
| | - Lauren E Pepi
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia
| | - Parastoo Azadi
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia
| | - Joseph Zaia
- Department of Biochemistry, Boston University, Boston, Massachusetts
- Bioinformatics Program, Center for Biomedical Mass Spectrometry, Boston University, Boston, Massachusetts
| |
Collapse
|
17
|
Hackett WE, Zaia J. Calculating Glycoprotein Similarities From Mass Spectrometric Data. Mol Cell Proteomics 2021; 20:100028. [PMID: 32883803 PMCID: PMC8724611 DOI: 10.1074/mcp.r120.002223] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 08/24/2020] [Accepted: 09/03/2020] [Indexed: 12/23/2022] Open
Abstract
Complex protein glycosylation occurs through biosynthetic steps in the secretory pathway that create macro- and microheterogeneity of structure and function. Required for all life forms, glycosylation diversifies and adapts protein interactions with binding partners that underpin interactions at cell surfaces and pericellular and extracellular environments. Because these biological effects arise from heterogeneity of structure and function, it is necessary to measure their changes as part of the quest to understand nature. Quite often, however, the assumption behind proteomics that posttranslational modifications are discrete additions that can be modeled using the genome as a template does not apply to protein glycosylation. Rather, it is necessary to quantify the glycosylation distribution at each glycosite and to aggregate this information into a population of mature glycoproteins that exist in a given biological system. To date, mass spectrometric methods for assigning singly glycosylated peptides are well-established. But it is necessary to quantify glycosylation heterogeneity accurately in order to gauge the alterations that occur during biological processes. The task is to quantify the glycosylated peptide forms as accurately as possible and then apply appropriate bioinformatics algorithms to the calculation of micro- and macro-similarities. In this review, we summarize current approaches for protein quantification as they apply to this glycoprotein similarity problem.
Collapse
Affiliation(s)
- William E Hackett
- Bioinformatics Program, Boston University, Boston, Massachusetts, USA
| | - Joseph Zaia
- Bioinformatics Program, Boston University, Boston, Massachusetts, USA; Department of Biochemistry, Boston University, Boston, Massachusetts, USA.
| |
Collapse
|