1
|
Wang H, Zhao L, Yu Z, Zeng X, Shi S. CoNglyPred: Accurate Prediction of N-Linked Glycosylation Sites Using ESM-2 and Structural Features With Graph Network and Co-Attention. Proteomics 2025; 25:e202400210. [PMID: 39361250 DOI: 10.1002/pmic.202400210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Revised: 08/17/2024] [Accepted: 09/20/2024] [Indexed: 03/18/2025]
Abstract
N-Linked glycosylation is crucial for various biological processes such as protein folding, immune response, and cellular transport. Traditional experimental methods for determining N-linked glycosylation sites entail substantial time and labor investment, which has led to the development of computational approaches as a more efficient alternative. However, due to the limited availability of 3D structural data, existing prediction methods often struggle to fully utilize structural information and fall short in integrating sequence and structural information effectively. Motivated by the progress of protein pretrained language models (pLMs) and the breakthrough in protein structure prediction, we introduced a high-accuracy model called CoNglyPred. Having compared various pLMs, we opt for the large-scale pLM ESM-2 to extract sequence embeddings, thus mitigating certain limitations associated with manual feature extraction. Meanwhile, our approach employs a graph transformer network to process the 3D protein structures predicted by AlphaFold2. The final graph output and ESM-2 embedding are intricately integrated through a co-attention mechanism. Among a series of comprehensive experiments on the independent test dataset, CoNglyPred outperforms state-of-the-art models and demonstrates exceptional performance in case study. In addition, we are the first to report the uncertainty of N-linked glycosylation predictors using expected calibration error and expected uncertainty calibration error.
Collapse
Affiliation(s)
- Hongmei Wang
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Long Zhao
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Ziyuan Yu
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Ximin Zeng
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Shaoping Shi
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| |
Collapse
|
2
|
van Ede JM, Soic D, Pabst M. Decoding Sugars: Mass Spectrometric Advances in the Analysis of the Sugar Alphabet. MASS SPECTROMETRY REVIEWS 2025. [PMID: 39972673 DOI: 10.1002/mas.21927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 12/18/2024] [Accepted: 01/20/2025] [Indexed: 02/21/2025]
Abstract
Monosaccharides play a central role in metabolic networks and in the biosynthesis of glycomolecules, which perform essential functions across all domains of life. Thus, identifying and quantifying these building blocks is crucial in both research and industry. Routine methods have been established to facilitate the analysis of common monosaccharides. However, despite the presence of common metabolites, most organisms utilize distinct sets of monosaccharides and derivatives. These molecules therefore display a large diversity, potentially numbering in the hundreds or thousands, with many still unknown. This complexity presents significant challenges in the study of glycomolecules, particularly in microbes, including pathogens and those with the potential to serve as novel model organisms. This review discusses mass spectrometric techniques for the isomer-sensitive analysis of monosaccharides, their derivatives, and activated forms. Although mass spectrometry allows for untargeted analysis and sensitive detection in complex matrices, the presence of stereoisomers and extensive modifications necessitates the integration of advanced chromatographic, electrophoretic, ion mobility, or ion spectroscopic methods. Furthermore, stable-isotope incorporation studies are critical in elucidating biosynthetic routes in novel organisms.
Collapse
Affiliation(s)
- Jitske M van Ede
- Department of Biotechnology, Delft University of Technology, Delft, the Netherlands
| | - Dinko Soic
- Department of Biotechnology, Delft University of Technology, Delft, the Netherlands
- Faculty of Pharmacy and Biochemistry, University of Zagreb, Zagreb, Croatia
| | - Martin Pabst
- Department of Biotechnology, Delft University of Technology, Delft, the Netherlands
| |
Collapse
|
3
|
Onigbinde S, Gutierrez Reyes CD, Sandilya V, Chukwubueze F, Oluokun O, Sahioun S, Oluokun A, Mechref Y. Optimization of glycopeptide enrichment techniques for the identification of clinical biomarkers. Expert Rev Proteomics 2024; 21:431-462. [PMID: 39439029 PMCID: PMC11877277 DOI: 10.1080/14789450.2024.2418491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 07/28/2024] [Accepted: 10/11/2024] [Indexed: 10/25/2024]
Abstract
INTRODUCTION The identification and characterization of glycopeptides through LC-MS/MS and advanced enrichment techniques are crucial for advancing clinical glycoproteomics, significantly impacting the discovery of disease biomarkers and therapeutic targets. Despite progress in enrichment methods like Lectin Affinity Chromatography (LAC), Hydrophilic Interaction Liquid Chromatography (HILIC), and Electrostatic Repulsion Hydrophilic Interaction Chromatography (ERLIC), issues with specificity, efficiency, and scalability remain, impeding thorough analysis of complex glycosylation patterns crucial for disease understanding. AREAS COVERED This review explores the current challenges and innovative solutions in glycopeptide enrichment and mass spectrometry analysis, highlighting the importance of novel materials and computational advances for improving sensitivity and specificity. It outlines the potential future directions of these technologies in clinical glycoproteomics, emphasizing their transformative impact on medical diagnostics and therapeutic strategies. EXPERT OPINION The application of innovative materials such as Metal-Organic Frameworks (MOFs), Covalent Organic Frameworks (COFs), functional nanomaterials, and online enrichment shows promise in addressing challenges associated with glycoproteomics analysis by providing more selective and robust enrichment platforms. Moreover, the integration of artificial intelligence and machine learning is revolutionizing glycoproteomics by enhancing the processing and interpretation of extensive data from LC-MS/MS, boosting biomarker discovery, and improving predictive accuracy, thus supporting personalized medicine.
Collapse
Affiliation(s)
- Sherifdeen Onigbinde
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | | | - Vishal Sandilya
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | - Favour Chukwubueze
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | - Odunayo Oluokun
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | - Sarah Sahioun
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | - Ayobami Oluokun
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| | - Yehia Mechref
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409, United States
| |
Collapse
|
4
|
Bhardwaj S, Bulluss M, D'Aubeterre A, Derakhshani A, Penner R, Mahajan M, Mahajan VB, Dufour A. Integrating the analysis of human biopsies using post-translational modifications proteomics. Protein Sci 2024; 33:e4979. [PMID: 38533548 DOI: 10.1002/pro.4979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 03/07/2024] [Accepted: 03/16/2024] [Indexed: 03/28/2024]
Abstract
Proteome diversities and their biological functions are significantly amplified by post-translational modifications (PTMs) of proteins. Shotgun proteomics, which does not typically survey PTMs, provides an incomplete picture of the complexity of human biopsies in health and disease. Recent advances in mass spectrometry-based proteomic techniques that enrich and study PTMs are helping to uncover molecular detail from the cellular level to system-wide functions, including how the microbiome impacts human diseases. Protein heterogeneity and disease complexity are challenging factors that make it difficult to characterize and treat disease. The search for clinical biomarkers to characterize disease mechanisms and complexity related to patient diagnoses and treatment has proven challenging. Knowledge of PTMs is fundamentally lacking. Characterization of complex human samples that clarify the role of PTMs and the microbiome in human diseases will result in new discoveries. This review highlights the key role of proteomic techniques used to characterize unknown biological functions of PTMs derived from complex human biopsies. Through the integration of diverse methods used to profile PTMs, this review explores the genetic regulation of proteoforms, cells of origin expressing specific proteins, and several bioactive PTMs and their subsequent analyses by liquid chromatography and tandem mass spectrometry.
Collapse
Affiliation(s)
- Sonali Bhardwaj
- Department of Physiology and Pharmacology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- McCaig Institute for Bone and Joint Health, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Snyder Institute for Chronic Diseases, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Mitchell Bulluss
- Department of Physiology and Pharmacology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- McCaig Institute for Bone and Joint Health, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Snyder Institute for Chronic Diseases, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Ana D'Aubeterre
- Department of Physiology and Pharmacology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- McCaig Institute for Bone and Joint Health, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Snyder Institute for Chronic Diseases, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Afshin Derakhshani
- Department of Physiology and Pharmacology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- McCaig Institute for Bone and Joint Health, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Snyder Institute for Chronic Diseases, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Regan Penner
- Department of Physiology and Pharmacology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- McCaig Institute for Bone and Joint Health, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Snyder Institute for Chronic Diseases, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - MaryAnn Mahajan
- Molecular Surgery Laboratory, Stanford University, Palo Alto, California, USA
| | - Vinit B Mahajan
- Molecular Surgery Laboratory, Stanford University, Palo Alto, California, USA
- Department of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, California, USA
| | - Antoine Dufour
- Department of Physiology and Pharmacology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Department of Biochemistry and Molecular Biology, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- McCaig Institute for Bone and Joint Health, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Hotchkiss Brain Institute, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
- Snyder Institute for Chronic Diseases, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada
| |
Collapse
|