1
|
Raj A, Aggarwal S, Singh P, Yadav AK, Dash D. PgxSAVy: A tool for comprehensive evaluation of variant peptide quality in proteogenomics - catching the (un)usual suspects. Comput Struct Biotechnol J 2024; 23:711-722. [PMID: 38292474 PMCID: PMC10825656 DOI: 10.1016/j.csbj.2023.12.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 12/19/2023] [Accepted: 12/23/2023] [Indexed: 02/01/2024] Open
Abstract
Variant peptides resulting from single nucleotide polymorphisms (SNPs) can lead to aberrant protein functions and have translational potential for disease diagnosis and personalized therapy. Variant peptides detected by proteogenomics are fraught with high number of false positives, but there is no uniform and comprehensive approach to assess variant quality across analysis pipelines. Despite class-specific FDR along with ad-hoc filters, the problem is far from solved. These protocols are typically manual and tedious, and thus not uniform across labs. We demonstrate that variant peptide rescoring, integrated with intensity, variant event information and search result features, allows better discrimination of correct variant peptides. Implemented into PgxSAVy - a tool for quality control of variant peptides, this method can tackle the high rate of false positives. PgxSAVy provides a rigorous framework for quality control and annotations of variant peptides on the basis of (i) variant quality, (ii) isobaric masses, and (iii) disease annotation. PgxSAVy demonstrated high accuracy by identifying true variants with 98.43% accuracy on simulated data. Large-scale proteogenomic reanalysis of ∼2.8 million spectra (PXD004010 and PXD001468) resulted in 12,705 variant peptide spectrum matches (PSMs), of which PgxSAVy evaluated 3028 (23.8%), 1409 (11.1%) and 8268 (65.1%) as confident, semi-confident and doubtful respectively. PgxSAVy also annotates the variants based on their pathogenicity and provides support for assisted manual validation. The analysis of proteins carrying variants can provide fine granularity in discovering important pathways. PgxSAVy will advance personalized medicine by providing a comprehensive framework for quality control and prioritization of proteogenomics variants. PgxSAVy is freely available at https://pgxsavy.igib.res.in/ as a webserver and https://github.com/anuragraj/PgxSAVy as a stand-alone tool.
Collapse
Affiliation(s)
- Anurag Raj
- G. N. Ramachandran Knowledge Centre for Genomics Informatics, CSIR – Institute of Genomics and Integrative Biology, New Delhi, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Suruchi Aggarwal
- Computational and Mathematical Biology Centre (CMBC), 3rd Milestone, Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India
- Centre for Drug Discovery (CDD), 3rd Milestone, Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India
- Centre for Microbial Research (CMR), Translational Health Science and Technology Institute, NCR Biotech Science Cluster, 3rd Milestone, Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India
| | - Prateek Singh
- G. N. Ramachandran Knowledge Centre for Genomics Informatics, CSIR – Institute of Genomics and Integrative Biology, New Delhi, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Amit Kumar Yadav
- Computational and Mathematical Biology Centre (CMBC), 3rd Milestone, Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India
- Centre for Drug Discovery (CDD), 3rd Milestone, Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India
- Centre for Microbial Research (CMR), Translational Health Science and Technology Institute, NCR Biotech Science Cluster, 3rd Milestone, Faridabad-Gurgaon Expressway, Faridabad, Haryana 121001, India
| | - Debasis Dash
- G. N. Ramachandran Knowledge Centre for Genomics Informatics, CSIR – Institute of Genomics and Integrative Biology, New Delhi, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| |
Collapse
|
2
|
Flender D, Vilenne F, Adams C, Boonen K, Valkenborg D, Baggerman G. Exploring the dynamic landscape of immunopeptidomics: Unravelling posttranslational modifications and navigating bioinformatics terrain. MASS SPECTROMETRY REVIEWS 2024. [PMID: 39152539 DOI: 10.1002/mas.21905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 07/30/2024] [Accepted: 08/01/2024] [Indexed: 08/19/2024]
Abstract
Immunopeptidomics is becoming an increasingly important field of study. The capability to identify immunopeptides with pivotal roles in the human immune system is essential to shift the current curative medicine towards personalized medicine. Throughout the years, the field has matured, giving insight into the current pitfalls. Nowadays, it is commonly accepted that generalizing shotgun proteomics workflows is malpractice because immunopeptidomics faces numerous challenges. While many of these difficulties have been addressed, the road towards the ideal workflow remains complicated. Although the presence of Posttranslational modifications (PTMs) in the immunopeptidome has been demonstrated, their identification remains highly challenging despite their significance for immunotherapies. The large number of unpredictable modifications in the immunopeptidome plays a pivotal role in the functionality and these challenges. This review provides a comprehensive overview of the current advancements in immunopeptidomics. We delve into the challenges associated with identifying PTMs within the immunopeptidome, aiming to address the current state of the field.
Collapse
Affiliation(s)
- Daniel Flender
- Centre for Proteomics, University of Antwerp, Antwerpen, Belgium
- Health Unit, VITO, Mol, Belgium
| | - Frédérique Vilenne
- Health Unit, VITO, Mol, Belgium
- Data Science Institute, University of Hasselt, Hasselt, Belgium
| | - Charlotte Adams
- Department of Computer Science, University of Antwerp, Antwerp, Belgium
| | - Kurt Boonen
- Centre for Proteomics, University of Antwerp, Antwerpen, Belgium
- ImmuneSpec, Niel, Belgium
| | - Dirk Valkenborg
- Data Science Institute, University of Hasselt, Hasselt, Belgium
| | - Geert Baggerman
- Department of Computer Science, University of Antwerp, Antwerp, Belgium
- ImmuneSpec, Niel, Belgium
| |
Collapse
|
3
|
Joyce AW, Searle BC. Computational approaches to identify sites of phosphorylation. Proteomics 2024; 24:e2300088. [PMID: 37897210 DOI: 10.1002/pmic.202300088] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 10/07/2023] [Accepted: 10/09/2023] [Indexed: 10/29/2023]
Abstract
Due to their oftentimes ambiguous nature, phosphopeptide positional isomers can present challenges in bottom-up mass spectrometry-based workflows as search engine scores alone are often not enough to confidently distinguish them. Additional scoring algorithms can remedy this by providing confidence metrics in addition to these search results, reducing ambiguity. Here we describe challenges to interpreting phosphoproteomics data and review several different approaches to determine sites of phosphorylation for both data-dependent and data-independent acquisition-based workflows. Finally, we discuss open questions regarding neutral losses, gas-phase rearrangement, and false localization rate estimation experienced by both types of acquisition workflows and best practices for managing ambiguity in phosphosite determination.
Collapse
Affiliation(s)
- Alex W Joyce
- Department of Biomedical Informatics, The Ohio State University Medical Center, Columbus, Ohio, USA
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio, USA
| | - Brian C Searle
- Department of Biomedical Informatics, The Ohio State University Medical Center, Columbus, Ohio, USA
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio, USA
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio, USA
| |
Collapse
|
4
|
Kitamura N, Galligan JJ. A global view of the human post-translational modification landscape. Biochem J 2023; 480:1241-1265. [PMID: 37610048 PMCID: PMC10586784 DOI: 10.1042/bcj20220251] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 07/26/2023] [Accepted: 08/07/2023] [Indexed: 08/24/2023]
Abstract
Post-translational modifications (PTMs) provide a rapid response to stimuli, finely tuning metabolism and gene expression and maintain homeostasis. Advances in mass spectrometry over the past two decades have significantly expanded the list of known PTMs in biology and as instrumentation continues to improve, this list will surely grow. While many PTMs have been studied in detail (e.g. phosphorylation, acetylation), the vast majority lack defined mechanisms for their regulation and impact on cell fate. In this review, we will highlight the field of PTM research as it currently stands, discussing the mechanisms that dictate site specificity, analytical methods for their detection and study, and the chemical tools that can be leveraged to define PTM regulation. In addition, we will highlight the approaches needed to discover and validate novel PTMs. Lastly, this review will provide a starting point for those interested in PTM biology, providing a comprehensive list of PTMs and what is known regarding their regulation and metabolic origins.
Collapse
Affiliation(s)
- Naoya Kitamura
- Department of Pharmacology and College of Pharmacy, University of Arizona, Tucson, Arizona 85721, U.S.A
| | - James J. Galligan
- Department of Pharmacology and College of Pharmacy, University of Arizona, Tucson, Arizona 85721, U.S.A
| |
Collapse
|
5
|
Cao Y, Liu XT, Mao PZ, Chen ZL, Tarn C, Dong MQ. Comparative Analysis of Chemical Cross-Linking Mass Spectrometry Data Indicates That Protein STY Residues Rarely React with N-Hydroxysuccinimide Ester Cross-Linkers. J Proteome Res 2023; 22:2593-2607. [PMID: 37494005 DOI: 10.1021/acs.jproteome.3c00037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
When it comes to mass spectrometry data analysis for identification of peptide pairs linked by N-hydroxysuccinimide (NHS) ester cross-linkers, search engines bifurcate in their setting of cross-linkable sites. Some restrict NHS ester cross-linkable sites to lysine (K) and protein N-terminus, referred to as K only for short, whereas others additionally include serine (S), threonine (T), and tyrosine (Y) by default. Here, by setting amino acids with chemically inert side chains such as glycine (G), valine (V), and leucine (L) as cross-linkable sites, which serves as a negative control, we show that software-identified STY-cross-links are only as reliable as GVL-cross-links. This is true across different NHS ester cross-linkers including DSS, DSSO, and DSBU, and across different search engines including MeroX, xiSearch, and pLink. Using a published data set originated from synthetic peptides, we demonstrate that STY-cross-links indeed have a high false discovery rate. Further analysis revealed that depending on the data and the search engine used to analyze the data, up to 65% of the STY-cross-links identified are actually K-K cross-links of the same peptide pairs, up to 61% are actually K-mono-links, and the rest tend to contain short peptides at high risk of false identification.
Collapse
Affiliation(s)
- Yong Cao
- National Institute of Biological Sciences, Beijing, Beijing 102206, China
| | - Xin-Tong Liu
- National Institute of Biological Sciences, Beijing, Beijing 102206, China
| | - Peng-Zhi Mao
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhen-Lin Chen
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ching Tarn
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Meng-Qiu Dong
- National Institute of Biological Sciences, Beijing, Beijing 102206, China
- Tsinghua Institute of Multidisciplinary Biomedical Research, Tsinghua University, Beijing 102206, China
| |
Collapse
|
6
|
Bedran G, Gasser HC, Weke K, Wang T, Bedran D, Laird A, Battail C, Zanzotto FM, Pesquita C, Axelson H, Rajan A, Harrison DJ, Palkowski A, Pawlik M, Parys M, O'Neill JR, Brennan PM, Symeonides SN, Goodlett DR, Litchfield K, Fahraeus R, Hupp TR, Kote S, Alfaro JA. The Immunopeptidome from a Genomic Perspective: Establishing the Noncanonical Landscape of MHC Class I-Associated Peptides. Cancer Immunol Res 2023; 11:747-762. [PMID: 36961404 PMCID: PMC10236148 DOI: 10.1158/2326-6066.cir-22-0621] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 11/25/2022] [Accepted: 03/16/2023] [Indexed: 03/25/2023]
Abstract
Tumor antigens can emerge through multiple mechanisms, including translation of noncoding genomic regions. This noncanonical category of tumor antigens has recently gained attention; however, our understanding of how they recur within and between cancer types is still in its infancy. Therefore, we developed a proteogenomic pipeline based on deep learning de novo mass spectrometry (MS) to enable the discovery of noncanonical MHC class I-associated peptides (ncMAP) from noncoding regions. Considering that the emergence of tumor antigens can also involve posttranslational modifications (PTM), we included an open search component in our pipeline. Leveraging the wealth of MS-based immunopeptidomics, we analyzed data from 26 MHC class I immunopeptidomic studies across 11 different cancer types. We validated the de novo identified ncMAPs, along with the most abundant PTMs, using spectral matching and controlled their FDR to 1%. The noncanonical presentation appeared to be 5 times enriched for the A03 HLA supertype, with a projected population coverage of 55%. The data reveal an atlas of 8,601 ncMAPs with varying levels of cancer selectivity and suggest 17 cancer-selective ncMAPs as attractive therapeutic targets according to a stringent cutoff. In summary, the combination of the open-source pipeline and the atlas of ncMAPs reported herein could facilitate the identification and screening of ncMAPs as targets for T-cell therapies or vaccine development.
Collapse
Affiliation(s)
- Georges Bedran
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
| | | | - Kenneth Weke
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
| | - Tongjie Wang
- School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
| | - Dominika Bedran
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
| | - Alexander Laird
- Urology Department, Western General Hospital, NHS Lothian, Edinburgh, United Kingdom
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - Christophe Battail
- CEA, Grenoble Alpes University, INSERM, IRIG, Biosciences and Bioengineering for Health Laboratory (BGE) - UA13 INSERM-CEA-UGA, Grenoble, France
| | | | - Catia Pesquita
- LASIGE, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
| | - Håkan Axelson
- Division of Translational Cancer Research, Department of Laboratory Medicine, Lund University, Lund, Sweden
| | - Ajitha Rajan
- School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
| | - David J. Harrison
- School of Medicine, University of St Andrews, St Andrews, United Kingdom
| | - Aleksander Palkowski
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
| | - Maciej Pawlik
- Academic Computer Centre CYFRONET, AGH University of Science and Technology, Cracow, Poland
| | - Maciej Parys
- Royal (Dick) School of Veterinary Studies and The Roslin Institute, University of Edinburgh, Edinburgh, United Kingdom
| | - J. Robert O'Neill
- Cambridge Oesophagogastric Centre, Cambridge University Hospitals NHS Foundation Trust, Cambridge, United Kingdom
| | - Paul M. Brennan
- Translational Neurosurgery, Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Stefan N. Symeonides
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - David R. Goodlett
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, Canada
- University of Victoria Genome BC Proteome Centre, Victoria, Canada
| | - Kevin Litchfield
- Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, United Kingdom
- Tumour Immunogenomics and Immunosurveillance Laboratory, University College London Cancer Institute, London, United Kingdom
| | - Robin Fahraeus
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
- Inserm UMRS1131, Institut de Génétique Moléculaire, Université Paris 7, Paris, France
| | - Ted R. Hupp
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - Sachin Kote
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
| | - Javier A. Alfaro
- International Centre for Cancer Vaccine Science, University of Gdansk, Gdansk, Poland
- School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
- Department of Biochemistry and Microbiology, University of Victoria, Victoria, Canada
| |
Collapse
|
7
|
Sun Z, Xiao W, Li N, Chang L, Xu P, Li Y. Large-Scale Profiling of Unexpected Tryptic Cleaved Sites at Ubiquitinated Lysines. J Proteome Res 2023; 22:1245-1254. [PMID: 36877145 DOI: 10.1021/acs.jproteome.2c00748] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2023]
Abstract
Trypsin specifically cleaves the C-terminus of lysine and arginine residues but often fails to cleave modified lysines, such as ubiquitination, therefore resulting in the uncleaved K-ε-GG peptides. Therefore, the cleaved ubiquitinated peptide identification was often regarded as false positives and discarded. Interestingly, unexpected cleavage at the K48-linked ubiquitin chain has been reported, suggesting the latent ability of trypsin to cleave ubiquitinated lysine residues. However, it remains unclear whether other trypsin-cleavable ubiquitinated sites are present. In this study, we verified the ability of trypsin in cleaving K6 and K63 besides K48 chains. The uncleaved K-ε-GG peptide was quickly and efficiently generated during trypsin digestion, whereas cleaved ones were produced with much lower efficiency. Then, the K-ε-GG antibody was proved to efficiently enrich the cleaved K-ε-GG peptides and several published large-scale ubiquitylation datasets were re-analyzed to interrogate the cleaved sequence features. In total, more than 2400 cleaved ubiquitinated peptides were identified in the K-ε-GG and UbiSite antibody-based datasets. The frequency of lysine upstream of the cleaved modified K was significantly enriched. The kinetic activity of trypsin in cleaving ubiquitinated peptides was further elucidated. We suggest that the cleaved K-ε-GG sites with high post-translational modification probability (≥0.75) should be considered as true positives in future ubiquitome analyses.
Collapse
Affiliation(s)
- Zhen Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China.,State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing 100850, P. R. China
| | - Weidi Xiao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China
| | - Naikang Li
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China
| | - Ping Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China.,Anhui Medical University, Hefei 230032, China.,School of Basic Medical Science, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, Wuhan University, Wuhan 430071, P. R. China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Institute of Lifeomics, Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China.,Anhui Medical University, Hefei 230032, China
| |
Collapse
|
8
|
Zeng X, Lan Y, Xiao J, Hu L, Tan L, Liang M, Wang X, Lu S, Peng T, Long F. Advances in phosphoproteomics and its application to COPD. Expert Rev Proteomics 2022; 19:311-324. [PMID: 36730079 DOI: 10.1080/14789450.2023.2176756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
INTRODUCTION Chronic obstructive pulmonary disease (COPD) was the third leading cause of global death in 2019, causing a huge economic burden to society. Therefore, it is urgent to identify specific phenotypes of COPD patients through early detection, and to promptly treat exacerbations. The field of phosphoproteomics has been a massive advancement, compelled by the developments in mass spectrometry, enrichment strategies, algorithms, and tools. Modern mass spectrometry-based phosphoproteomics allows understanding of disease pathobiology, biomarker discovery, and predicting new therapeutic modalities. AREAS COVERED In this article, we present an overview of phosphoproteomic research and strategies for enrichment and fractionation of phosphopeptides, identification of phosphorylation sites, chromatographic separation and mass spectrometry detection strategies, and the potential application of phosphorylated proteomic analysis in the diagnosis, treatment, and prognosis of COPD disease. EXPERT OPINION The role of phosphoproteomics in COPD is critical for understanding disease pathobiology, identifying potential biomarkers, and predicting new therapeutic approaches. However, the complexity of COPD requires the more comprehensive understanding that can be achieved through integrated multi-omics studies. Phosphoproteomics, as a part of these multi-omics approaches, can provide valuable insights into the underlying mechanisms of COPD.
Collapse
Affiliation(s)
- Xiaoyin Zeng
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Yanting Lan
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Jing Xiao
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Longbo Hu
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Long Tan
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Mengdi Liang
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Xufei Wang
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Shaohua Lu
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| | - Tao Peng
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China.,Guangdong South China Vaccine Co. Ltd, Guangzhou, China
| | - Fei Long
- Sino-French Hoffmann Institute, School of Basic Medical Science, State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
9
|
Liu Y, Pan X, Bao Y, Wei L, Gao Y. Many kinds of oxidized proteins are present more in the urine of the elderly. Clin Proteomics 2022; 19:22. [PMID: 35733114 PMCID: PMC9214981 DOI: 10.1186/s12014-022-09360-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Accepted: 06/08/2022] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND Many studies have shown an association between aging and oxidation. To our knowledge, there have been no studies exploring aging-related urine proteome modifications. The purpose of this study was to explore differences in global chemical modifications of urinary protein at different ages. METHODS Discovery (n=38) cohort MS data including children, young and old groups were downloaded from three published studies, and this data was analyzed using open-pFind for identifying modifications. Verification cohort human samples (n=28) including young, middle-aged, and old groups, rat samples (n=7) at three-time points after birth, adulthood, and old age were collected and processed in the laboratory simultaneously based on label-free quantification combined with pFind. RESULTS Discovery cohort: there were 28 kinds of differential oxidations in the old group that were higher than those in the young or children group in. Verification cohort: there were 17 kinds of differential oxidations of 49 oxidized proteins in the middle and old groups, which were significantly higher than those in the young group. Both oxidations and oxidized proteins distinguished different age groups well. There were also 15 kinds of differential oxidations in old age higher than others in the rat cohort. The results showed that the validation experiment was basically consistent with the results of the discovery experiment, showing that the level of oxidized proteins in urine increased significantly with age. CONCLUSIONS Our study is the first to show that oxidative proteins occur in urine and that oxidations are higher in older than younger ages. Perhaps improving the degree of excretion of oxidative protein in vivo through the kidney is helpful for maintaining the homeostasis of the body's internal environment, delaying aging and the occurrence of senile diseases.
Collapse
Affiliation(s)
- Yongtao Liu
- Department of Biochemistry and Molecular Biology, Beijing Key Laboratory of Gene Engineering Drug and Biotechnology, Beijing Normal University, Beijing, 100875, China
| | - Xuanzhen Pan
- Department of Biochemistry and Molecular Biology, Beijing Key Laboratory of Gene Engineering Drug and Biotechnology, Beijing Normal University, Beijing, 100875, China
| | - Yijin Bao
- Department of Biochemistry and Molecular Biology, Beijing Key Laboratory of Gene Engineering Drug and Biotechnology, Beijing Normal University, Beijing, 100875, China
| | - Lilong Wei
- Clinical Laboratory, China-Japan Friendship Hospital, Beijing, 100029, China
| | - Youhe Gao
- Department of Biochemistry and Molecular Biology, Beijing Key Laboratory of Gene Engineering Drug and Biotechnology, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
10
|
Hamood F, Bayer FP, Wilhelm M, Kuster B, The M. SIMSI-Transfer: Software-assisted reduction of missing values in phosphoproteomic and proteomic isobaric labeling data using tandem mass spectrum clustering. Mol Cell Proteomics 2022; 21:100238. [PMID: 35462064 PMCID: PMC9389303 DOI: 10.1016/j.mcpro.2022.100238] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 03/18/2022] [Accepted: 03/27/2022] [Indexed: 12/11/2022] Open
Abstract
Isobaric stable isotope labeling techniques such as tandem mass tags (TMTs) have become popular in proteomics because they enable the relative quantification of proteins with high precision from up to 18 samples in a single experiment. While missing values in peptide quantification are rare in a single TMT experiment, they rapidly increase when combining multiple TMT experiments. As the field moves toward analyzing ever higher numbers of samples, tools that reduce missing values also become more important for analyzing TMT datasets. To this end, we developed SIMSI-Transfer (Similarity-based Isobaric Mass Spectra 2 [MS2] Identification Transfer), a software tool that extends our previously developed software MaRaCluster (© Matthew The) by clustering similar tandem MS2 from multiple TMT experiments. SIMSI-Transfer is based on the assumption that similarity-clustered MS2 spectra represent the same peptide. Therefore, peptide identifications made by database searching in one TMT batch can be transferred to another TMT batch in which the same peptide was fragmented but not identified. To assess the validity of this approach, we tested SIMSI-Transfer on masked search engine identification results and recovered >80% of the masked identifications while controlling errors in the transfer procedure to below 1% false discovery rate. Applying SIMSI-Transfer to six published full proteome and phosphoproteome datasets from the Clinical Proteomic Tumor Analysis Consortium led to an increase of 26 to 45% of identified MS2 spectra with TMT quantifications. This significantly decreased the number of missing values across batches and, in turn, increased the number of peptides and proteins identified in all TMT batches by 43 to 56% and 13 to 16%, respectively. Spectrum clustering enables peptide identification transfer between LC–MS/MS runs. The SIMSI pipeline supports processing full proteome and phosphoproteome data. SIMSI increases the number of quantifiable PSMs by 26 to 45%. SIMSI reduces missing values in multibatch TMT labeling experiments by up to 21%.
Collapse
Affiliation(s)
- Firas Hamood
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany
| | - Florian P Bayer
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany
| | - Mathias Wilhelm
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany.
| | - Matthew The
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Freising, Germany.
| |
Collapse
|
11
|
Wang T, Wang G, Zhang G, Hou R, Zhou L, Tian X. Systematic analysis of the lysine malonylome in Sanghuangporus sanghuang. BMC Genomics 2021; 22:840. [PMID: 34798813 PMCID: PMC8603570 DOI: 10.1186/s12864-021-08120-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 10/22/2021] [Indexed: 01/18/2023] Open
Abstract
Background Sanghuangporus sanghuang is a well-known traditional medicinal mushroom associated with mulberry. Despite the properties of this mushroom being known for many years, the regulatory mechanisms of bioactive compound biosynthesis in this medicinal mushroom are still unclear. Lysine malonylation is a posttranslational modification that has many critical functions in various aspects of cell metabolism. However, at present we do not know its role in S. sanghuang. In this study, a global investigation of the lysine malonylome in S. sanghuang was therefore carried out. Results In total, 714 malonyl modification sites were matched to 255 different proteins. The analysis indicated that malonyl modifications were involved in a wide range of cellular functions and displayed a distinct subcellular localization. Bioinformatics analysis indicated that malonylated proteins were engaged in different metabolic pathways, including glyoxylate and dicarboxylate metabolism, glycolysis/gluconeogenesis, and the tricarboxylic acid (TCA) cycle. Notably, a total of 26 enzymes related to triterpene and polysaccharide biosynthesis were found to be malonylated, indicating an indispensable role of lysine malonylation in bioactive compound biosynthesis in S. sanghuang. Conclusions These findings suggest that malonylation is associated with many metabolic pathways, particularly the metabolism of the bioactive compounds triterpene and polysaccharide. This paper represents the first comprehensive survey of malonylation in S. sanghuang and provides important data for further study on the physiological function of lysine malonylation in S. sanghuang and other medicinal mushrooms. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08120-0.
Collapse
Affiliation(s)
- Tong Wang
- Shandong Province Key Laboratory of Applied Mycology, Qingdao Agricultural University, Changcheng Road, No.700, Qingdao, 266109, China
| | - Guangyuan Wang
- Shandong Province Key Laboratory of Applied Mycology, Qingdao Agricultural University, Changcheng Road, No.700, Qingdao, 266109, China
| | - Guoli Zhang
- Shandong Province Key Laboratory of Applied Mycology, Qingdao Agricultural University, Changcheng Road, No.700, Qingdao, 266109, China
| | - Ranran Hou
- Shandong Province Key Laboratory of Applied Mycology, Qingdao Agricultural University, Changcheng Road, No.700, Qingdao, 266109, China
| | - Liwei Zhou
- State Key Laboratory of Mycology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xuemei Tian
- Shandong Province Key Laboratory of Applied Mycology, Qingdao Agricultural University, Changcheng Road, No.700, Qingdao, 266109, China.
| |
Collapse
|
12
|
Yi X, Liao Y, Wen B, Li K, Dou Y, Savage SR, Zhang B. caAtlas: An immunopeptidome atlas of human cancer. iScience 2021; 24:103107. [PMID: 34622160 PMCID: PMC8479791 DOI: 10.1016/j.isci.2021.103107] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 08/10/2021] [Accepted: 09/03/2021] [Indexed: 01/24/2023] Open
Abstract
Comprehensive characterization of tumor antigens is essential for the design of cancer immunotherapies, and mass spectrometry (MS)-based immunopeptidomics enables high-throughput identification of major histocompatibility complex (MHC)-bound peptide antigens in vivo. Here we construct an immunopeptidome atlas of human cancer through an extensive collection of 43 published immunopeptidomic datasets and standardized analysis of 81.6 million MS/MS spectra using an open search engine. Our analysis greatly expands the current knowledge of MHC-bound antigens, including an unprecedented characterization of post-translationally modified antigens and their cancer-association. We also perform systematic analysis of cancer-testis antigens, cancer-associated antigens, and neoantigens. We make all these data together with annotated MS/MS spectra supporting identification of each antigen in an easily browsable web portal named cancer antigen atlas (caAtlas). caAtlas provides a central resource for the selection and prioritization of MHC-bound peptides for in vitro HLA binding assay and immunogenicity testing, which will pave the way to eventual development of cancer immunotherapies. Extensive collection of 43 immunopeptidomic datasets with 1018 samples Standardized and rigorous identification of HLA-bound peptides, including PTM peptides Comprehensive annotation of CT antigens and cancer-associated antigens User-friendly data dissemination through the caAtlas web portal
Collapse
Affiliation(s)
- Xinpei Yi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yuxing Liao
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
13
|
Levitsky LI, Bubis JA, Gorshkov MV, Tarasova IA. AA_stat: Intelligent profiling of in vivo and in vitro modifications from open search results. J Proteomics 2021; 248:104350. [PMID: 34389500 DOI: 10.1016/j.jprot.2021.104350] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 07/21/2021] [Accepted: 08/04/2021] [Indexed: 11/18/2022]
Abstract
Characterization of post-translational modifications is among the most challenging tasks in tandem mass spectrometry-based proteomics which has yet to find an efficient solution. The ultra-tolerant (open) database search attempts to meet this challenge. However, interpretation of the mass shifts observed in open search still requires an effective and automated solution. We have previously introduced the AA_stat tool for analysis of amino acid frequencies at different mass shifts and generation of hypotheses on unaccounted in vitro modifications. Here, we report on the new version of AA_stat, which now complements amino acid frequency statistics with a number of new features: (1) MS/MS-based localization of mass shifts and localization scoring, including shifts which are the sum of modifications; (2) inferring fixed modifications to increase method sensitivity; (3) inferring monoisotopic peak assignment errors and variable modifications based on abundant mass shift localizations to increase the yield of closed search; (4) new mass calibration algorithm to account for partial systematic shifts; (5) interactive integration of all results and a rated list of possible mass shift interpretations. With these options, we improve interpretation of open search results and demonstrate the utility of AA_stat for profiling of abundant and rare amino acid modifications. AA_stat is implemented in Python as an open-source command-line tool available at https://github.com/SimpleNumber/aa_stat. SIGNIFICANCE: Mass spectrometry-based PTM characterization has a long history, yet most of the methods rely on a priori knowledge of modifications of interest and do not provide a whole proteome modification landscape in a blind manner. The open database search is an efficient attempt to address this challenge by identifying peptides with mass shifts corresponding to possible modifications. Then, interpreting these mass shifts is required. Therefore, development of bioinformatics software for post-processing of the open search results, which is capable of detection and accurate annotation of new or unexpected modifications, from characterization of sample preparation efficiency and quality control to discovery of rare post-translational modifications, is of high importance.
Collapse
Affiliation(s)
- Lev I Levitsky
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Julia A Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Irina A Tarasova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia.
| |
Collapse
|
14
|
Moaddel R, Ubaida‐Mohien C, Tanaka T, Lyashkov A, Basisty N, Schilling B, Semba RD, Franceschi C, Gorospe M, Ferrucci L. Proteomics in aging research: A roadmap to clinical, translational research. Aging Cell 2021; 20:e13325. [PMID: 33730416 PMCID: PMC8045948 DOI: 10.1111/acel.13325] [Citation(s) in RCA: 83] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 12/31/2020] [Accepted: 01/18/2021] [Indexed: 02/06/2023] Open
Abstract
The identification of plasma proteins that systematically change with age and, independent of chronological age, predict accelerated decline of health is an expanding area of research. Circulating proteins are ideal translational "omics" since they are final effectors of physiological pathways and because physicians are accustomed to use information of plasma proteins as biomarkers for diagnosis, prognosis, and tracking the effectiveness of treatments. Recent technological advancements, including mass spectrometry (MS)-based proteomics, multiplexed proteomic assay using modified aptamers (SOMAscan), and Proximity Extension Assay (PEA, O-Link), have allowed for the assessment of thousands of proteins in plasma or other biological matrices, which are potentially translatable into new clinical biomarkers and provide new clues about the mechanisms by which aging is associated with health deterioration and functional decline. We carried out a detailed literature search for proteomic studies performed in different matrices (plasma, serum, urine, saliva, tissues) and species using multiple platforms. Herein, we identified 232 proteins that were age-associated across studies. Enrichment analysis of the 232 age-associated proteins revealed metabolic pathways previously connected with biological aging both in animal models and in humans, most remarkably insulin-like growth factor (IGF) signaling, mitogen-activated protein kinases (MAPK), hypoxia-inducible factor 1 (HIF1), cytokine signaling, Forkhead Box O (FOXO) metabolic pathways, folate metabolism, advance glycation end products (AGE), and receptor AGE (RAGE) metabolic pathway. Information on these age-relevant proteins, likely expanded and validated in longitudinal studies and examined in mechanistic studies, will be essential for patient stratification and the development of new treatments aimed at improving health expectancy.
Collapse
Affiliation(s)
- Ruin Moaddel
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | | | - Toshiko Tanaka
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | - Alexey Lyashkov
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | | | | | - Richard D Semba
- Wilmer Eye Institute Johns Hopkins University School of Medicine Baltimore MD USA
| | - Claudio Franceschi
- University of Bologna and IRCCS Institute of Neurological Sciences Bologna Italy
| | - Myriam Gorospe
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| | - Luigi Ferrucci
- Biomedical Research Centre National Institute on Aging, NIH Baltimore MD USA
| |
Collapse
|
15
|
Moyer TB, Parsley NC, Sadecki PW, Schug WJ, Hicks LM. Leveraging orthogonal mass spectrometry based strategies for comprehensive sequencing and characterization of ribosomal antimicrobial peptide natural products. Nat Prod Rep 2021; 38:489-509. [PMID: 32929442 PMCID: PMC7956910 DOI: 10.1039/d0np00046a] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Covering: Up to July 2020Ribosomal antimicrobial peptide (AMP) natural products, also known as ribosomally synthesized and post-translationally modified peptides (RiPPs) or host defense peptides, demonstrate potent bioactivities and impressive complexity that complicate molecular and biological characterization. Tandem mass spectrometry (MS) has rapidly accelerated bioactive peptide sequencing efforts, yet standard workflows insufficiently address intrinsic AMP diversity. Herein, orthogonal approaches to accelerate comprehensive and accurate molecular characterization without the need for prior isolation are reviewed. Chemical derivatization, proteolysis (enzymatic and chemical cleavage), multistage MS fragmentation, and separation (liquid chromatography and ion mobility) strategies can provide complementary amino acid composition and post-translational modification data to constrain sequence solutions. Examination of two complex case studies, gomesin and styelin D, highlights the practical implementation of the proposed approaches. Finally, we emphasize the importance of heterogeneous AMP peptidoforms that confer varying biological function, an area that warrants significant further development.
Collapse
Affiliation(s)
- Tessa B Moyer
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| | | | | | | | | |
Collapse
|
16
|
Schulze S, Igiraneza AB, Kösters M, Leufken J, Leidel SA, Garcia BA, Fufezan C, Pohlschroder M. Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal. J Proteome Res 2021; 20:1986-1996. [PMID: 33514075 DOI: 10.1021/acs.jproteome.0c00799] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The identification of peptide sequences and their post-translational modifications (PTMs) is a crucial step in the analysis of bottom-up proteomics data. The recent development of open modification search (OMS) engines allows virtually all PTMs to be searched for. This not only increases the number of spectra that can be matched to peptides but also greatly advances the understanding of the biological roles of PTMs through the identification, and the thereby facilitated quantification, of peptidoforms (peptide sequences and their potential PTMs). Whereas the benefits of combining results from multiple protein database search engines have been previously established, similar approaches for OMS results have been missing so far. Here we compare and combine results from three different OMS engines, demonstrating an increase in peptide spectrum matches of 8-18%. The unification of search results furthermore allows for the combined downstream processing of search results, including the mapping to potential PTMs. Finally, we test for the ability of OMS engines to identify glycosylated peptides. The implementation of these engines in the Python framework Ursgal facilitates the straightforward application of the OMS with unified parameters and results files, thereby enabling yet unmatched high-throughput, large-scale data analysis.
Collapse
Affiliation(s)
- Stefan Schulze
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Aime Bienfait Igiraneza
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Manuel Kösters
- Department of Chemistry and Biochemistry, University of Bern, 3012 Bern, Switzerland
| | - Johannes Leufken
- Department of Chemistry and Biochemistry, University of Bern, 3012 Bern, Switzerland
| | - Sebastian A Leidel
- Department of Chemistry and Biochemistry, University of Bern, 3012 Bern, Switzerland
| | - Benjamin A Garcia
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Christian Fufezan
- Institute of Pharmacy and Molecular Biotechnology, Heidelberg University, 69120 Heidelberg, Germany
| | - Mechthild Pohlschroder
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
17
|
Montaño KJ, Loukas A, Sotillo J. Proteomic approaches to drive advances in helminth extracellular vesicle research. Mol Immunol 2021; 131:1-5. [PMID: 33440289 DOI: 10.1016/j.molimm.2020.12.030] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 12/14/2020] [Accepted: 12/21/2020] [Indexed: 12/17/2022]
Abstract
Helminths can interact with their hosts in many different ways, including through the secretion of soluble molecules (such as lipids, glycans and proteins) and extracellular vesicles (EVs). The field of helminth secreted EVs has significantly advanced in recent years, mainly due to the molecular characterisation of EV proteomes and research highlighting the potential of EVs and their constituent molecules in the diagnosis and control of parasitic infections. Despite these advancements, the lack of appropriate isolation and purification methods is impeding the discovery of suitable biomarkers for the differentiation of helminth EV populations. In the present review we offer our viewpoint on the different proteomic techniques and approaches that have been developed, as well as solutions to common pitfalls and challenges that could be applied to advance the study of helminth EVs.
Collapse
Affiliation(s)
- Karen J Montaño
- Centro Nacional de Microbiología, Instituto de Salud Carlos III, Majadahonda, Madrid, Spain
| | - Alex Loukas
- Centre for Molecular Therapeutics, Australian Institute of Tropical Health and Medicine, James Cook University, Cairns, Australia
| | - Javier Sotillo
- Centro Nacional de Microbiología, Instituto de Salud Carlos III, Majadahonda, Madrid, Spain.
| |
Collapse
|
18
|
Geiszler DJ, Kong AT, Avtonomov DM, Yu F, Leprevost FDV, Nesvizhskii AI. PTM-Shepherd: Analysis and Summarization of Post-Translational and Chemical Modifications From Open Search Results. Mol Cell Proteomics 2020; 20:100018. [PMID: 33568339 PMCID: PMC7950090 DOI: 10.1074/mcp.tir120.002216] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 11/13/2020] [Accepted: 12/01/2020] [Indexed: 01/17/2023] Open
Abstract
Open searching has proven to be an effective strategy for identifying both known and unknown modifications in shotgun proteomics experiments. Rather than being limited to a small set of user-specified modifications, open searches identify peptides with any mass shift that may correspond to a single modification or a combination of several modifications. Here we present PTM-Shepherd, a bioinformatics tool that automates characterization of post-translational modification profiles detected in open searches based on attributes, such as amino acid localization, fragmentation spectra similarity, retention time shifts, and relative modification rates. PTM-Shepherd can also perform multiexperiment comparisons for studying changes in modification profiles, e.g., in data generated in different laboratories or under different conditions. We demonstrate how PTM-Shepherd improves the analysis of data from formalin-fixed and paraffin-embedded samples, detects extreme underalkylation of cysteine in some data sets, discovers an artifactual modification introduced during peptide synthesis, and uncovers site-specific biases in sample preparation artifacts in a multicenter proteomics profiling study.
Collapse
Affiliation(s)
- Daniel J Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Andy T Kong
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Dmitry M Avtonomov
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | | | - Alexey I Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA; Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA.
| |
Collapse
|
19
|
Yang Y, Horvatovich P, Qiao L. Fragment Mass Spectrum Prediction Facilitates Site Localization of Phosphorylation. J Proteome Res 2020; 20:634-644. [PMID: 32985198 DOI: 10.1021/acs.jproteome.0c00580] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Liquid chromatography tandem mass spectrometry (LC-MS/MS) has been the most widely used technology for phosphoproteomics studies. As an alternative to database searching and probability-based phosphorylation site localization approaches, spectral library searching has been proved to be effective in the identification of phosphopeptides. However, incompletion of experimental spectral libraries limits the identification capability. Herein, we utilize MS/MS spectrum prediction coupled with spectral matching for site localization of phosphopeptides. In silico MS/MS spectra are generated from peptide sequences by deep learning/machine learning models trained with nonphosphopeptides. Then, mass shift according to phosphorylation sites, phosphoric acid neutral loss, and a "budding" strategy are adopted to adjust the in silico mass spectra. In silico MS/MS spectra can also be generated in one step for phosphopeptides using models trained with phosphopeptides. The method is benchmarked on data sets of synthetic phosphopeptides and is used to process real biological samples. It is demonstrated to be a method requiring only computational resources that supplements the probability-based approaches for phosphorylation site localization of singly and multiply phosphorylated peptides.
Collapse
Affiliation(s)
- Yi Yang
- Department of Chemistry and Shanghai Stomatological Hospital, Fudan University, Handan Road 220, Shanghai 200000, China
| | - Peter Horvatovich
- Department of Pharmacy, University of Groningen, Antonius Deusinglaan 1, Groningen 9700 AD, The Netherlands
| | - Liang Qiao
- Department of Chemistry and Shanghai Stomatological Hospital, Fudan University, Handan Road 220, Shanghai 200000, China
| |
Collapse
|
20
|
Liu Y, Pan X, Zhao M, Gao Y. Global chemical modifications comparison of human plasma proteome from two different age groups. Sci Rep 2020; 10:14998. [PMID: 32929118 PMCID: PMC7490693 DOI: 10.1038/s41598-020-72196-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 08/17/2020] [Indexed: 11/09/2022] Open
Abstract
In this study, two groups of human plasma proteome at different age groups (old and young) were used to perform a comparison of global chemical modifications, as determined by tandem mass spectrometry (MS/MS) combined with non-limiting modification identification algorithms. The sulfhydryl in the cysteine A total of 4 molecular modifications were found to have significant differences passing random grouping tests: the succinylation and phosphorylation modification of cysteine (Cys, C) and the modification of lysine (Lys, K) with threonine (Thr, T) were significantly higher in the old group than in the young group, while the carbamylation of lysine was lower in the young group. We speculate that there is an increase in certain modified proteins in the blood of the old people which, in turn, changes the function of those proteins. This change may be one of the reasons why old people are more likely than young people to be at risk for age-related diseases, such as metabolic diseases, cerebral and cardiovascular diseases, and cancer.
Collapse
Affiliation(s)
- Yongtao Liu
- Department of Biochemistry and Molecular Biology, Beijing Key Laboratory of Gene Engineering Drug and Biotechnology, Beijing Normal University, Beijing, China
| | - Xuanzhen Pan
- Department of Biochemistry and Molecular Biology, Beijing Key Laboratory of Gene Engineering Drug and Biotechnology, Beijing Normal University, Beijing, China
| | - Mindi Zhao
- Department of Laboratory Medicine, National Geriatrics Center, Beijing Hospital, Beijing, China
| | - Youhe Gao
- Department of Biochemistry and Molecular Biology, Beijing Key Laboratory of Gene Engineering Drug and Biotechnology, Beijing Normal University, Beijing, China.
| |
Collapse
|
21
|
Ming L, Zou Y, Zhao Y, Zhang L, He N, Chen Z, Li SSC, Li L. MMS2plot: An R Package for Visualizing Multiple MS/MS Spectra for Groups of Modified and Non-Modified Peptides. Proteomics 2020; 20:e2000061. [PMID: 32643287 DOI: 10.1002/pmic.202000061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 06/06/2020] [Indexed: 11/11/2022]
Abstract
A large number of post-translational modifications (PTMs) in proteins are buried in the unassigned mass spectrometric (MS) spectra in shot-gun proteomics datasets. Because the modified peptide fragments are low in abundance relative to the corresponding non-modified versions, it is critical to develop tools that allow facile evaluation of assignment of PTMs based on the MS/MS spectra. Such tools will preferably have the ability to allow comparison of fragment ion spectra and retention time between the modified and unmodified peptide pairs or group. Herein, MMS2plot, an R package for visualizing peptide-spectrum matches (PSMs) for multiple peptides, is described. MMS2plot features a batch mode and generates the output images in vector graphics file format that facilitate evaluation and publication of the PSM assignment. MMS2plot is expected to play an important role in PTM discovery from large-scale proteomics datasets generated by liquid chromatography-MS/MS. The MMS2plot package is freely available at https://github.com/lileir/MMS2plot under the GPL-3 license.
Collapse
Affiliation(s)
- Liya Ming
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Yang Zou
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Yiming Zhao
- Data Science and Software Engineering, Qingdao University, Qingdao, 266021, China
| | - Luna Zhang
- Data Science and Software Engineering, Qingdao University, Qingdao, 266021, China
| | - Ningning He
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Zhen Chen
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
| | - Shawn S-C Li
- Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University, London, N6A 5C1, Canada
| | - Lei Li
- School of Basic Medicine, Qingdao University, Qingdao, 266021, China
- Data Science and Software Engineering, Qingdao University, Qingdao, 266021, China
| |
Collapse
|
22
|
Xu F, Yu L, Peng X, Zhang J, Li S, Liu S, Yin Y, An Z, Wang F, Fu Y, Xu P. Unambiguous Phosphosite Localization through the Combination of Trypsin and LysargiNase Mirror Spectra in a Large-Scale Phosphoproteome Study. J Proteome Res 2020; 19:2185-2194. [PMID: 32388983 DOI: 10.1021/acs.jproteome.9b00562] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Understanding of the kinase-guided signaling pathways requires the identification and analysis of phosphosites. Mass spectrometry (MS)-based phosphoproteomics is a rapid and highly sensitive approach for high-throughput identification of phosphosites. However, phosphosite determination from MS data with a single protease is more likely to be ambiguous, regardless of the strategy used for phosphopeptide detection. Here, we explored the application of LysargiNase, which was recently reported to mirror trypsin in specificity to cleave arginine and lysine residues exclusively at the N-terminal side. We found that the combination of trypsin and LysargiNase mirror spectra resulted in higher ion coverage in MS2 spectra. The median ion coverage values of b ions in tryptic spectra, LysargiNase spectra, and combined spectra are 8.3, 20.5, and 25.0%, respectively. As for the median ion coverage of y ions, these values are 27.8, 10.0, and 32.3%. Higher ion coverage was helpful to pinpoint the precise phosphosites. Compared to trypsin alone, the combined use of trypsin and LysargiNase mirror spectra enabled 67.1% of mirror spectra with unreliable scores (confidence score <0.75) to become reliable (confidence score ≥ 0.75). Meanwhile, all of the mirror peptide-spectrum matches (PSMs) with multiple potential phosphosites from trypsin and LysargiNase digests could be assigned one precise phosphosite after applying the combination strategy. Besides, the combination strategy could identify more novel phosphosites than the union strategy did. We synthesized three phosphopeptides corresponding to the three novel phosphosites and validated the reliability of the identification. Taken together, our data demonstrated the distinctive potential of the combination strategy presented here for unambiguous phosphosite localization (Project accession PXD011178).
Collapse
Affiliation(s)
- Feng Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Li Yu
- National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100864, China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xuehui Peng
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Lifeomics, Beijing 102206, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, School of Medicine, Wuhan University, Wuhan430072, China
| | - Junling Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Suzhen Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Lifeomics, Beijing 102206, China.,Anhui Medical University, Hefei 230032, China
| | - Shu Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yanan Yin
- Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, School of Medicine, Wuhan University, Wuhan430072, China
| | - Zhiwu An
- National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100864, China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Fuqiang Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yan Fu
- National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100864, China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ping Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences Beijing, Beijing Institute of Lifeomics, Beijing 102206, China.,Anhui Medical University, Hefei 230032, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, School of Medicine, Wuhan University, Wuhan430072, China
| |
Collapse
|
23
|
Yi X, Gong F, Fu Y. Transfer posterior error probability estimation for peptide identification. BMC Bioinformatics 2020; 21:173. [PMID: 32366221 PMCID: PMC7199311 DOI: 10.1186/s12859-020-3485-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 04/08/2020] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND In shotgun proteomics, database searching of tandem mass spectra results in a great number of peptide-spectrum matches (PSMs), many of which are false positives. Quality control of PSMs is a multiple hypothesis testing problem, and the false discovery rate (FDR) or the posterior error probability (PEP) is the commonly used statistical confidence measure. PEP, also called local FDR, can evaluate the confidence of individual PSMs and thus is more desirable than FDR, which evaluates the global confidence of a collection of PSMs. Estimation of PEP can be achieved by decomposing the null and alternative distributions of PSM scores as long as the given data is sufficient. However, in many proteomic studies, only a group (subset) of PSMs, e.g. those with specific post-translational modifications, are of interest. The group can be very small, making the direct PEP estimation by the group data inaccurate, especially for the high-score area where the score threshold is taken. Using the whole set of PSMs to estimate the group PEP is inappropriate either, because the null and/or alternative distributions of the group can be very different from those of combined scores. RESULTS The transfer PEP algorithm is proposed to more accurately estimate the PEPs of peptide identifications in small groups. Transfer PEP derives the group null distribution through its empirical relationship with the combined null distribution, and estimates the group alternative distribution, as well as the null proportion, using an iterative semi-parametric method. Validated on both simulated data and real proteomic data, transfer PEP showed remarkably higher accuracy than the direct combined and separate PEP estimation methods. CONCLUSIONS We presented a novel approach to group PEP estimation for small groups and implemented it for the peptide identification problem in proteomics. The methodology of the approach is in principle applicable to the small-group PEP estimation problems in other fields.
Collapse
Affiliation(s)
- Xinpei Yi
- National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Fuzhou Gong
- National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China. .,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| | - Yan Fu
- National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China. .,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
24
|
Shu Q, Li M, Shu L, An Z, Wang J, Lv H, Yang M, Cai T, Hu T, Fu Y, Yang F. Large-scale Identification of N-linked Intact Glycopeptides in Human Serum using HILIC Enrichment and Spectral Library Search. Mol Cell Proteomics 2020; 19:672-689. [PMID: 32102970 PMCID: PMC7124471 DOI: 10.1074/mcp.ra119.001791] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 02/10/2020] [Indexed: 11/12/2022] Open
Abstract
Large-scale identification of N-linked intact glycopeptides by liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) in human serum is challenging because of the wide dynamic range of serum protein abundances, the lack of a complete serum N-glycan database and the existence of proteoforms. In this regard, a spectral library search method was presented for the identification of N-linked intact glycopeptides from N-linked glycoproteins in human serum with target-decoy and motif-specific false discovery rate (FDR) control. Serum proteins were firstly separated into low-abundance and high-abundance proteins by acetonitrile (ACN) precipitation. After digestion, the N-linked intact glycopeptides were enriched by hydrophilic interaction liquid chromatography (HILIC) and a portion of the enriched N-linked intact glycopeptides were processed by Peptide-N-Glycosidase F (PNGase F) to generate N-linked deglycopeptides. Both N-linked intact glycopeptides and deglycopeptides were analyzed by LC-MS/MS. From N-linked deglycopeptides data sets, 764 N-linked glycoproteins, 1699 N-linked glycosites and 3328 unique N-linked deglycopeptides were identified. Four types of N-linked glycosylation motifs (NXS/T/C/V, X≠P) were used to recognize the N-linked deglycopeptides. The spectra of these N-linked deglycopeptides were utilized for N-linked deglycopeptides library construction and identification of N-linked intact glycopeptides. A database containing 739 N-glycan masses was constructed and utilized during spectral library search for the identification of N-linked intact glycopeptides. In total, 526 N-linked glycoproteins, 1036 N-linked glycosites, 22,677 N-linked intact glycopeptides and 738 N-glycan masses were identified under 1% FDR, representing the most in-depth serum N-glycoproteome identified by LC-MS/MS at N-linked intact glycopeptide level.
Collapse
Affiliation(s)
- Qingbo Shu
- Laboratory of Protein and Peptide Pharmaceuticals & Proteomics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112
| | - Mengjie Li
- Computer Network Information Center, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112
| | - Lian Shu
- Laboratory of Protein and Peptide Pharmaceuticals & Proteomics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhiwu An
- Computer Network Information Center, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112
| | - Jifeng Wang
- Laboratory of Protein and Peptide Pharmaceuticals & Proteomics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Hao Lv
- Computer Network Information Center, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112; Research Center for Basic Sciences of Medicine, Basic Medical College, Guizhou Medical University, Guiyang 550025, China
| | - Ming Yang
- Laboratory of Protein and Peptide Pharmaceuticals & Proteomics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112
| | - Tanxi Cai
- Laboratory of Protein and Peptide Pharmaceuticals & Proteomics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112
| | - Tony Hu
- National Center for Mathematics and Interdisciplinary Sciences, Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
| | - Yan Fu
- Computer Network Information Center, Chinese Academy of Sciences, Beijing 100101, China; Center for Cellular and Molecular Diagnostics, Department of Biochemistry and Molecular Biology, School of Medicine, Tulane University, New Orleans, Louisiana 70112.
| | - Fuquan Yang
- Laboratory of Protein and Peptide Pharmaceuticals & Proteomics Laboratory, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
25
|
Paik YK, Overall CM, Corrales F, Deutsch EW, Lane L, Omenn GS. Advances in Identifying and Characterizing the Human Proteome. J Proteome Res 2019; 18:4079-4084. [PMID: 31805768 DOI: 10.1021/acs.jproteome.9b00745] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Affiliation(s)
- Young-Ki Paik
- Yonsei Proteome Research Center, College of Life Science and Technology , Yonsei University
| | - Christopher M Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences and Biochemistry & Molecular Biology, Faculty of Dentistry , University of British Columbia
| | - Fernando Corrales
- Functional Proteomics Laboratory National Center of Biotechnology , CSIC
| | | | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, Faculty of Medicine, CMU , University of Geneva
| | - Gilbert S Omenn
- Institute for Systems Biology, Departments of Computational Medicine & Bioinformatics, Internal Medicine, and Human Genetics & School of Public Health , University of Michigan
| |
Collapse
|
26
|
Shteynberg DD, Deutsch EW, Campbell DS, Hoopmann MR, Kusebauch U, Lee D, Mendoza L, Midha MK, Sun Z, Whetton AD, Moritz RL. PTMProphet: Fast and Accurate Mass Modification Localization for the Trans-Proteomic Pipeline. J Proteome Res 2019; 18:4262-4272. [PMID: 31290668 PMCID: PMC6898736 DOI: 10.1021/acs.jproteome.9b00205] [Citation(s) in RCA: 96] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Spectral matching sequence database search engines commonly used on mass spectrometry-based proteomics experiments excel at identifying peptide sequence ions, and in addition, possible sequence ions carrying post-translational modifications (PTMs), but most do not provide confidence metrics for the exact localization of those PTMs when several possible sites are available. Localization is absolutely required for downstream molecular cell biology analysis of PTM function in vitro and in vivo. Therefore, we developed PTMProphet, a free and open-source software tool integrated into the Trans-Proteomic Pipeline, which reanalyzes identified spectra from any search engine for which pepXML output is available to provide localization confidence to enable appropriate further characterization of biologic events. Localization of any type of mass modification (e.g., phosphorylation) is supported. PTMProphet applies Bayesian mixture models to compute probabilities for each site/peptide spectrum match where a PTM has been identified. These probabilities can be combined to compute a global false localization rate at any threshold to guide downstream analysis. We describe the PTMProphet tool, its underlying algorithms, and demonstrate its performance on ground-truth synthetic peptide reference data sets, one previously published small data set, one new larger data set, and also on a previously published phosphoenriched data set where the correct sites of modification are unknown. Data have been deposited to ProteomeXchange with identifier PXD013210.
Collapse
Affiliation(s)
| | | | | | | | | | - Dave Lee
- Stoller Biomarker Discovery Centre, University of Manchester, Manchester, M13 9PL, UK
| | - Luis Mendoza
- Institute for Systems Biology, Seattle, WA, 98008, USA
| | | | - Zhi Sun
- Institute for Systems Biology, Seattle, WA, 98008, USA
| | - Anthony D. Whetton
- Stoller Biomarker Discovery Centre, University of Manchester, Manchester, M13 9PL, UK
| | | |
Collapse
|
27
|
Using the tools of proteomics to understand the pathogenesis of idiopathic inflammatory myopathies. Curr Opin Rheumatol 2019; 31:617-622. [PMID: 31385878 DOI: 10.1097/bor.0000000000000643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
PURPOSE OF REVIEW One of the most important advances in medical research over the past 20 years has been the emergence of technologies to assess complex biological processes on a global scale. Although a great deal of attention has been given to genome-scale genetics and genomics technologies, the utility of studying the proteome in a comprehensive way is sometimes under-appreciated. In this review, we discuss recent advances in proteomics as applied to dermatomyositis/polymyositis as well as findings from other inflammatory diseases that may enlighten our understanding of dermatomyositis/polymyositis. RECENT FINDINGS Proteomic approaches have been used to investigate basic mechanisms contributing to lung and skin disease in dermatomyositis/polymyositis as well as to the muscle disease itself. In addition, proteomic approaches have been used to identify autoantibodies targeting the endothelium in juvenile dermatomyositis. Studies from other inflammatory diseases have shown the promise of using proteomics to characterize the composition of immune complexes and the protein cargoes of exosomes. SUMMARY There are many relevant scientific and clinical questions in dermatomyositis/polymyositis that can be addressed using proteomics approaches. Careful attention to both methodology and analytic approaches are required to obtain useful and reproducible data.
Collapse
|