1
|
Jadav R, Weiland F, Noordermeer SM, Carroll T, Gao Y, Wang J, Zhou H, Lamoliatte F, Toth R, Macartney T, Brown F, Hastie CJ, Alabert C, van Attikum H, Zenke F, Masson JY, Rouse J. Chemo-phosphoproteomic profiling with ATR inhibitors berzosertib and gartisertib uncovers new biomarkers and DNA damage response regulators. Mol Cell Proteomics 2024:100802. [PMID: 38880245 DOI: 10.1016/j.mcpro.2024.100802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 06/04/2024] [Accepted: 06/13/2024] [Indexed: 06/18/2024] Open
Abstract
The ATR kinase protects cells against DNA damage and replication stress and represents a promising anti-cancer drug target. The ATR inhibitors (ATRi) berzosertib and gartisertib are both in clinical trials for the treatment of advanced solid tumours as monotherapy or in combination with genotoxic agents. We carried out quantitative phospho-proteomic screening for ATR biomarkers that are highly sensitive to berzosertib and gartisertib, using an optimized mass spectrometry pipeline. Screening identified a range of novel ATR-dependent phosphorylation events, which were grouped into three broad classes: i) targets whose phosphorylation is highly sensitive to ATRi and which could be the next generation of ATR biomarkers; ii) proteins with known genome maintenance roles not previously known to be regulated by ATR; iii) novel targets whose cellular roles are unclear. Class iii targets represent candidate DNA damage response proteins and, with this in mind, proteins in this class were subjected to secondary screening for recruitment to DNA damage sites. We show that one of the proteins recruited, SCAF1, interacts with RNAPII in a phospho-dependent manner and recruitment requires PARP activity and interaction with RNAPII. We also show that SCAF1 deficiency partly rescues RAD51 loading in cells lacking the BRCA1 tumour suppressor. Taken together these data reveal potential new ATR biomarkers and new genome maintenance factors.
Collapse
Affiliation(s)
- Rathan Jadav
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Florian Weiland
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Sylvie M Noordermeer
- Dept of Human Genetics, Leiden University Medical Center, Einthovenweg 20, 2333 ZC Leiden, Netherlands; Oncode institute, Utrecht, The Netherlands
| | - Thomas Carroll
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Yuandi Gao
- CHU de Quebec Research Center, Oncology Division, Dept. of Molecular Biology, Medical Biochemistry and Pathology, Laval University Cancer Research Center, 9 McMahon Drive, Quebec Cit, QC G1R 3S3, Canada
| | - Jianming Wang
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Houjiang Zhou
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Frederic Lamoliatte
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Rachel Toth
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Thomas Macartney
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Fiona Brown
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - C James Hastie
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Constance Alabert
- Division of Molecular, Cell and Developmental Biology, School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK
| | - Haico van Attikum
- CHU de Quebec Research Center, Oncology Division, Dept. of Molecular Biology, Medical Biochemistry and Pathology, Laval University Cancer Research Center, 9 McMahon Drive, Quebec Cit, QC G1R 3S3, Canada
| | - Frank Zenke
- EMD Serono, Research Unit Oncology, Billerica, MA, USA
| | - Jean-Yves Masson
- CHU de Quebec Research Center, Oncology Division, Dept. of Molecular Biology, Medical Biochemistry and Pathology, Laval University Cancer Research Center, 9 McMahon Drive, Quebec Cit, QC G1R 3S3, Canada
| | - John Rouse
- MRC Protein Phosphorylation and Ubiquitylation Unit and School of Life Sciences, Wellcome Trust Biocentre, University of Dundee, DD1 5EH, UK.
| |
Collapse
|
2
|
Li K, Teo GC, Yang KL, Yu F, Nesvizhskii AI. diaTracer enables spectrum-centric analysis of diaPASEF proteomics data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.25.595875. [PMID: 38854051 PMCID: PMC11160675 DOI: 10.1101/2024.05.25.595875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Data-independent acquisition (DIA) has become a widely used strategy for peptide and protein quantification in mass spectrometry-based proteomics studies. The integration of ion mobility separation into DIA analysis, such as the diaPASEF technology available on Bruker's timsTOF platform, further improves the quantification accuracy and protein depth achievable using DIA. We introduce diaTracer, a new spectrum-centric computational tool optimized for diaPASEF data. diaTracer performs three-dimensional (m/z, retention time, ion mobility) peak tracing and feature detection to generate precursor-resolved "pseudo-MS/MS" spectra, facilitating direct ("spectral-library free") peptide identification and quantification from diaPASEF data. diaTracer is available as a stand-alone tool and is fully integrated into the widely used FragPipe computational platform. We demonstrate the performance of diaTracer and FragPipe using diaPASEF data from cerebrospinal fluid (CSF) and plasma samples, data from phosphoproteomics and HLA immunopeptidomics experiments, and low-input data from a spatial proteomics study. We also show that diaTracer enables unrestricted identification of post-translational modifications from diaPASEF data using open/mass offset searches.
Collapse
Affiliation(s)
- Kai Li
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Kevin L. Yang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I. Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
3
|
Thesbjerg MN, Poulsen KO, Astono J, Poulsen NA, Larsen LB, Nielsen SDH, Stensballe A, Sundekilde UK. O-linked glycosylations in human milk casein and major whey proteins during lactation. Int J Biol Macromol 2024; 267:131613. [PMID: 38642686 DOI: 10.1016/j.ijbiomac.2024.131613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 04/10/2024] [Accepted: 04/13/2024] [Indexed: 04/22/2024]
Abstract
As glycosylations are difficult to analyze, their roles and effects are poorly understood. Glycosylations in human milk (HM) differ across lactation. Glycosylations can be involved in antimicrobial activities and may serve as food for beneficial microorganisms. This study aimed to identify and analyze O-linked glycans in HM by high-throughput mass spectrometry. 184 longitudinal HM samples from 66 donors from day 3 and months 1, 2, and 3 postpartum were subjected to a post-translational modification specific enrichment-based strategy using TiO2 and ZrO2 beads for O-linked glycopeptide enrichment. β-CN was found to be a major O-linked glycoprotein, additionally, αS1-CN, κ-CN, lactotransferrin, and albumin also contained O-linked glycans. As glycosyltransferases and glycosidases are involved in assembling the glycans including O-linked glycosylations, these were further investigated. Some glycosyltransferases and glycosidases were found to be significantly decreasing through lactation, including two O-linked glycan initiator enzymes (GLNT1 and GLNT2). Despite their decrease, the overall level of O-linked glycans remained stable in HM over lactation. Three different motifs for O-linked glycosylation were enriched in HM proteins: Gly-Xxx-Xxx-Gly-Ser/Thr, Arg-Ser/Thr and Lys-Ser/Thr. Further O-linked glycan motifs on β-CN were observed to differ between intact proteins and endogenous peptides in HM.
Collapse
Affiliation(s)
- Martin Nørmark Thesbjerg
- Department of Food Science, Aarhus University, Agro Food Park 48, DK-8200 Aarhus N, Denmark; Sino-Danish College (SDC), University of Chinese Academy of Science, Huairou District, Beijing 101408, China.
| | - Katrine Overgaard Poulsen
- Department of Food Science, Aarhus University, Agro Food Park 48, DK-8200 Aarhus N, Denmark; Sino-Danish College (SDC), University of Chinese Academy of Science, Huairou District, Beijing 101408, China
| | - Julie Astono
- Department of Food Science, Aarhus University, Agro Food Park 48, DK-8200 Aarhus N, Denmark
| | - Nina Aagaard Poulsen
- Department of Food Science, Aarhus University, Agro Food Park 48, DK-8200 Aarhus N, Denmark
| | - Lotte Bach Larsen
- Department of Food Science, Aarhus University, Agro Food Park 48, DK-8200 Aarhus N, Denmark
| | | | - Allan Stensballe
- Department of Health Science and Technology, Aalborg University, Selma Lagerløfsvej 249, DK-9260 Gistrup, Denmark; Clinical cancer center, Aalborg University Hospital, 9000 Aalborg, Denmark
| | | |
Collapse
|
4
|
Freestone J, Noble WS, Keich U. Analysis of Tandem Mass Spectrometry Data with CONGA: Combining Open and Narrow Searches with Group-Wise Analysis. J Proteome Res 2024. [PMID: 38652578 DOI: 10.1021/acs.jproteome.3c00399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
Searching for tandem mass spectrometry proteomics data against a database is a well-established method for assigning peptide sequences to observed spectra but typically cannot identify peptides harboring unexpected post-translational modifications (PTMs). Open modification searching aims to address this problem by allowing a spectrum to match a peptide even if the spectrum's precursor mass differs from the peptide mass. However, expanding the search space in this way can lead to a loss of statistical power to detect peptides. We therefore developed a method, called CONGA (combining open and narrow searches with group-wise analysis), that takes into account results from both types of searches─a traditional "narrow window" search and an open modification search─while carrying out rigorous false discovery rate control. The result is an algorithm that provides the best of both worlds: the ability to detect unexpected PTMs without a concomitant loss of power to detect unmodified peptides.
Collapse
Affiliation(s)
- Jack Freestone
- School of Mathematics and Statistics F07, University of Sydney, NSW 2006, Australia
| | - William S Noble
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington 98195, United States
| | - Uri Keich
- School of Mathematics and Statistics F07, University of Sydney, NSW 2006, Australia
| |
Collapse
|
5
|
Burton NR, Backus KM. Functionalizing tandem mass tags for streamlining click-based quantitative chemoproteomics. Commun Chem 2024; 7:80. [PMID: 38600184 PMCID: PMC11006884 DOI: 10.1038/s42004-024-01162-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 03/27/2024] [Indexed: 04/12/2024] Open
Abstract
Mapping the ligandability or potential druggability of all proteins in the human proteome is a central goal of mass spectrometry-based covalent chemoproteomics. Achieving this ambitious objective requires high throughput and high coverage sample preparation and liquid chromatography-tandem mass spectrometry analysis for hundreds to thousands of reactive compounds and chemical probes. Conducting chemoproteomic screens at this scale benefits from technical innovations that achieve increased sample throughput. Here we realize this vision by establishing the silane-based cleavable linkers for isotopically-labeled proteomics-tandem mass tag (sCIP-TMT) proteomic platform, which is distinguished by early sample pooling that increases sample preparation throughput. sCIP-TMT pairs a custom click-compatible sCIP capture reagent that is readily functionalized in high yield with commercially available TMT reagents. Synthesis and benchmarking of a 10-plex set of sCIP-TMT reveal a substantial decrease in sample preparation time together with high coverage and high accuracy quantification. By screening a focused set of four cysteine-reactive electrophiles, we demonstrate the utility of sCIP-TMT for chemoproteomic target hunting, identifying 789 total liganded cysteines. Distinguished by its compatibility with established enrichment and quantification protocols, we expect sCIP-TMT will readily translate to a wide range of covalent chemoproteomic applications.
Collapse
Affiliation(s)
- Nikolas R Burton
- Department of Biological Chemistry, David Geffen School of Medicine, UCLA, Los Angeles CA, USA
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, CA, USA
| | - Keriann M Backus
- Department of Biological Chemistry, David Geffen School of Medicine, UCLA, Los Angeles CA, USA.
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, CA, USA.
- Molecular Biology Institute, UCLA, Los Angeles, CA, USA.
- DOE Institute for Genomics and Proteomics, UCLA, Los Angeles, CA, USA.
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, USA.
- Jonsson Comprehensive Cancer Center, UCLA, Los Angeles, CA, USA.
| |
Collapse
|
6
|
Zhang NH, Deutsch EW. SpectiCal: m/ z Calibration of MS2 Peptide Spectra Using Known Low Mass Ions. J Proteome Res 2024; 23:1519-1530. [PMID: 38538550 DOI: 10.1021/acs.jproteome.3c00882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/06/2024]
Abstract
Most tandem mass spectrometry fragmentation spectra have small calibration errors that can lead to suboptimal interpretation and annotation. We developed SpectiCal, a software tool that can read mzML files from data-dependent acquisition proteomics experiments in parallel, compute m/z calibrations for each file prior to identification analysis based on known low-mass ions, and produce information about frequently observed peaks and their explanations. Using calibration coefficients, the data can be corrected to generate new calibrated mzML files. SpectiCal was tested using five public data sets, creating a table of commonly observed low-mass ions and their identifications. Information about the calibration and individual peaks is written in PDF and TSV files. This includes information for each peak, such as the number of runs in which it appears, the percentage of spectra in which it appears, and a plot of the aggregated region surrounding each peak. SpectiCal can be used to compute MS run calibrations, examine MS runs for artifacts that might hinder downstream analysis, and generate tables of detected low-mass ions for further analysis. SpectiCal is freely available at https://github.com/PlantProteomes/SpectiCal.
Collapse
Affiliation(s)
- Nathan H Zhang
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| |
Collapse
|
7
|
Ferreira HJ, Stevenson BJ, Pak H, Yu F, Almeida Oliveira J, Huber F, Taillandier-Coindard M, Michaux J, Ricart-Altimiras E, Kraemer AI, Kandalaft LE, Speiser DE, Nesvizhskii AI, Müller M, Bassani-Sternberg M. Immunopeptidomics-based identification of naturally presented non-canonical circRNA-derived peptides. Nat Commun 2024; 15:2357. [PMID: 38490980 PMCID: PMC10943130 DOI: 10.1038/s41467-024-46408-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 02/16/2024] [Indexed: 03/18/2024] Open
Abstract
Circular RNAs (circRNAs) are covalently closed non-coding RNAs lacking the 5' cap and the poly-A tail. Nevertheless, it has been demonstrated that certain circRNAs can undergo active translation. Therefore, aberrantly expressed circRNAs in human cancers could be an unexplored source of tumor-specific antigens, potentially mediating anti-tumor T cell responses. This study presents an immunopeptidomics workflow with a specific focus on generating a circRNA-specific protein fasta reference. The main goal of this workflow is to streamline the process of identifying and validating human leukocyte antigen (HLA) bound peptides potentially originating from circRNAs. We increase the analytical stringency of our workflow by retaining peptides identified independently by two mass spectrometry search engines and/or by applying a group-specific FDR for canonical-derived and circRNA-derived peptides. A subset of circRNA-derived peptides specifically encoded by the region spanning the back-splice junction (BSJ) are validated with targeted MS, and with direct Sanger sequencing of the respective source transcripts. Our workflow identifies 54 unique BSJ-spanning circRNA-derived peptides in the immunopeptidome of melanoma and lung cancer samples. Our approach enlarges the catalog of source proteins that can be explored for immunotherapy.
Collapse
Affiliation(s)
- Humberto J Ferreira
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Brian J Stevenson
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - HuiSong Pak
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Jessica Almeida Oliveira
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Florian Huber
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Marie Taillandier-Coindard
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Justine Michaux
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Emma Ricart-Altimiras
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Anne I Kraemer
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| | - Lana E Kandalaft
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- Center of Experimental Therapeutics, Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Daniel E Speiser
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Markus Müller
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland.
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland.
- Agora Cancer Research Centre, Lausanne, Switzerland.
- Center of Experimental Therapeutics, Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland.
| |
Collapse
|
8
|
Yi X, Wen B, Ji S, Saltzman AB, Jaehnig EJ, Lei JT, Gao Q, Zhang B. Deep Learning Prediction Boosts Phosphoproteomics-Based Discoveries Through Improved Phosphopeptide Identification. Mol Cell Proteomics 2024; 23:100707. [PMID: 38154692 PMCID: PMC10831110 DOI: 10.1016/j.mcpro.2023.100707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 11/06/2023] [Accepted: 12/23/2023] [Indexed: 12/30/2023] Open
Abstract
Shotgun phosphoproteomics enables high-throughput analysis of phosphopeptides in biological samples. One of the primary challenges associated with this technology is the relatively low rate of phosphopeptide identification during data analysis. This limitation hampers the full realization of the potential offered by shotgun phosphoproteomics. Here we present DeepRescore2, a computational workflow that leverages deep learning-based retention time and fragment ion intensity predictions to improve phosphopeptide identification and phosphosite localization. Using a state-of-the-art computational workflow as a benchmark, DeepRescore2 increases the number of correctly identified peptide-spectrum matches by 17% in a synthetic dataset and identifies 19% to 46% more phosphopeptides in biological datasets. In a liver cancer dataset, 30% of the significantly altered phosphosites between tumor and normal tissues and 60% of the prognosis-associated phosphosites identified from DeepRescore2-processed data could not be identified based on the state-of-the-art workflow. Notably, DeepRescore2-processed data uniquely identifies EGFR hyperactivation as a new target in poor-prognosis liver cancer, which is validated experimentally. Integration of deep learning prediction in DeepRescore2 improves phosphopeptide identification and facilitates biological discoveries.
Collapse
Affiliation(s)
- Xinpei Yi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Shuyi Ji
- Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital and Key Laboratory of Carcinogenesis and Cancer Invasion of the Ministry of China, Fudan University, Shanghai, China
| | - Alexander B Saltzman
- Mass Spectrometry Proteomics Core, Advanced Technology Cores, Baylor College of Medicine, Houston, Texas, USA
| | - Eric J Jaehnig
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Jonathan T Lei
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Qiang Gao
- Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital and Key Laboratory of Carcinogenesis and Cancer Invasion of the Ministry of China, Fudan University, Shanghai, China
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA.
| |
Collapse
|
9
|
Genth J, Schäfer K, Cassidy L, Graspeuntner S, Rupp J, Tholey A. Identification of proteoforms of short open reading frame-encoded peptides in Blautia producta under different cultivation conditions. Microbiol Spectr 2023; 11:e0252823. [PMID: 37782090 PMCID: PMC10715070 DOI: 10.1128/spectrum.02528-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 08/14/2023] [Indexed: 10/03/2023] Open
Abstract
IMPORTANCE The identification of short open reading frame-encoded peptides (SEP) and different proteoforms in single cultures of gut microbes offers new insights into a largely neglected part of the microbial proteome landscape. This is of particular importance as SEP provide various predicted functions, such as acting as antimicrobial peptides, maintaining cell homeostasis under stress conditions, or even contributing to the virulence pattern. They are, thus, taking a poorly understood role in structure and function of microbial networks in the human body. A better understanding of SEP in the context of human health requires a precise understanding of the abundance of SEP both in commensal microbes as well as pathogens. For the gut beneficial B. producta, we demonstrate the importance of specific environmental conditions for biosynthesis of SEP expanding previous findings about their role in microbial interactions.
Collapse
Affiliation(s)
- Jerome Genth
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Kathrin Schäfer
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Simon Graspeuntner
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg-Lübeck-Borstel-Riems, Lübeck, Germany
| | - Jan Rupp
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg-Lübeck-Borstel-Riems, Lübeck, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| |
Collapse
|
10
|
Burton NR, Polasky DA, Shikwana F, Ofori S, Yan T, Geiszler DJ, Veiga Leprevost FD, Nesvizhskii AI, Backus KM. Solid-Phase Compatible Silane-Based Cleavable Linker Enables Custom Isobaric Quantitative Chemoproteomics. J Am Chem Soc 2023; 145:21303-21318. [PMID: 37738129 DOI: 10.1021/jacs.3c05797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/24/2023]
Abstract
Mass spectrometry-based chemoproteomics has emerged as an enabling technology for functional biology and drug discovery. To address limitations of established chemoproteomics workflows, including cumbersome reagent synthesis and low throughput sample preparation, here, we established the silane-based cleavable isotopically labeled proteomics (sCIP) method. The sCIP method is enabled by a high yielding and scalable route to dialkoxydiphenylsilane fluorenylmethyloxycarbonyl (DADPS-Fmoc)-protected amino acid building blocks, which enable the facile synthesis of customizable, isotopically labeled, and chemically cleavable biotin capture reagents. sCIP is compatible with both MS1- and MS2-based quantitation, and the sCIP-MS2 method is distinguished by its click-assembled isobaric tags in which the reporter group is encoded in the sCIP capture reagent and balancer in the pan cysteine-reactive probe. The sCIP-MS2 workflow streamlines sample preparation with early stage isobaric labeling and sample pooling, allowing for high coverage and increased sample throughput via customized low cost six-plex sample multiplexing. When paired with a custom FragPipe data analysis workflow and applied to cysteine-reactive fragment screens, sCIP proteomics revealed established and unprecedented cysteine-ligand pairs, including the discovery that mitochondrial uncoupling agent FCCP acts as a covalent-reversible cysteine-reactive electrophile.
Collapse
Affiliation(s)
- Nikolas R Burton
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Flowreen Shikwana
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Samuel Ofori
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Tianyang Yan
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California 90095, United States
| | - Daniel J Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | | | - Alexey I Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Keriann M Backus
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California 90095, United States
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, California 90095, United States
- DOE Institute for Genomics and Proteomics, University of California, Los Angeles, Los Angeles, California 90095, United States
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, Los Angeles, Los Angeles, California 90095, United States
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, California 90095, United States
| |
Collapse
|
11
|
Lei JT, Jaehnig EJ, Smith H, Holt MV, Li X, Anurag M, Ellis MJ, Mills GB, Zhang B, Labrie M. The Breast Cancer Proteome and Precision Oncology. Cold Spring Harb Perspect Med 2023; 13:a041323. [PMID: 37137501 PMCID: PMC10547392 DOI: 10.1101/cshperspect.a041323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
The goal of precision oncology is to translate the molecular features of cancer into predictive and prognostic tests that can be used to individualize treatment leading to improved outcomes and decreased toxicity. Success for this strategy in breast cancer is exemplified by efficacy of trastuzumab in tumors overexpressing ERBB2 and endocrine therapy for tumors that are estrogen receptor positive. However, other effective treatments, including chemotherapy, immune checkpoint inhibitors, and CDK4/6 inhibitors are not associated with strong predictive biomarkers. Proteomics promises another tier of information that, when added to genomic and transcriptomic features (proteogenomics), may create new opportunities to improve both treatment precision and therapeutic hypotheses. Here, we review both mass spectrometry-based and antibody-dependent proteomics as complementary approaches. We highlight how these methods have contributed toward a more complete understanding of breast cancer and describe the potential to guide diagnosis and treatment more accurately.
Collapse
Affiliation(s)
- Jonathan T Lei
- Lester and Sue Smith Breast Center and Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Eric J Jaehnig
- Lester and Sue Smith Breast Center and Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Hannah Smith
- Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon 97239, USA
| | - Matthew V Holt
- Lester and Sue Smith Breast Center and Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Xi Li
- Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon 97239, USA
| | - Meenakshi Anurag
- Lester and Sue Smith Breast Center and Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Matthew J Ellis
- Lester and Sue Smith Breast Center and Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Gordon B Mills
- Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon 97239, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center and Dan L. Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Marilyne Labrie
- Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon 97239, USA
| |
Collapse
|
12
|
Zhang B, Bassani-Sternberg M. Current perspectives on mass spectrometry-based immunopeptidomics: the computational angle to tumor antigen discovery. J Immunother Cancer 2023; 11:e007073. [PMID: 37899131 PMCID: PMC10619091 DOI: 10.1136/jitc-2023-007073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/21/2023] [Indexed: 10/31/2023] Open
Abstract
Identification of tumor antigens presented by the human leucocyte antigen (HLA) molecules is essential for the design of effective and safe cancer immunotherapies that rely on T cell recognition and killing of tumor cells. Mass spectrometry (MS)-based immunopeptidomics enables high-throughput, direct identification of HLA-bound peptides from a variety of cell lines, tumor tissues, and healthy tissues. It involves immunoaffinity purification of HLA complexes followed by MS profiling of the extracted peptides using data-dependent acquisition, data-independent acquisition, or targeted approaches. By incorporating DNA, RNA, and ribosome sequencing data into immunopeptidomics data analysis, the proteogenomic approach provides a powerful means for identifying tumor antigens encoded within the canonical open reading frames of annotated coding genes and non-canonical tumor antigens derived from presumably non-coding regions of our genome. We discuss emerging computational challenges in immunopeptidomics data analysis and tumor antigen identification, highlighting key considerations in the proteogenomics-based approach, including accurate DNA, RNA and ribosomal sequencing data analysis, careful incorporation of predicted novel protein sequences into reference protein database, special quality control in MS data analysis due to the expanded and heterogeneous search space, cancer-specificity determination, and immunogenicity prediction. The advancements in technology and computation is continually enabling us to identify tumor antigens with higher sensitivity and accuracy, paving the way toward the development of more effective cancer immunotherapies.
Collapse
Affiliation(s)
- Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, University of Lausanne, Lausanne, Switzerland
- Department of Oncology, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
- Agora Cancer Research Centre, Lausanne, Switzerland
| |
Collapse
|
13
|
Li Y, Dou Y, Da Veiga Leprevost F, Geffen Y, Calinawan AP, Aguet F, Akiyama Y, Anand S, Birger C, Cao S, Chaudhary R, Chilappagari P, Cieslik M, Colaprico A, Zhou DC, Day C, Domagalski MJ, Esai Selvan M, Fenyö D, Foltz SM, Francis A, Gonzalez-Robles T, Gümüş ZH, Heiman D, Holck M, Hong R, Hu Y, Jaehnig EJ, Ji J, Jiang W, Katsnelson L, Ketchum KA, Klein RJ, Lei JT, Liang WW, Liao Y, Lindgren CM, Ma W, Ma L, MacCoss MJ, Martins Rodrigues F, McKerrow W, Nguyen N, Oldroyd R, Pilozzi A, Pugliese P, Reva B, Rudnick P, Ruggles KV, Rykunov D, Savage SR, Schnaubelt M, Schraink T, Shi Z, Singhal D, Song X, Storrs E, Terekhanova NV, Thangudu RR, Thiagarajan M, Wang LB, Wang JM, Wang Y, Wen B, Wu Y, Wyczalkowski MA, Xin Y, Yao L, Yi X, Zhang H, Zhang Q, Zuhl M, Getz G, Ding L, Nesvizhskii AI, Wang P, Robles AI, Zhang B, Payne SH. Proteogenomic data and resources for pan-cancer analysis. Cancer Cell 2023; 41:1397-1406. [PMID: 37582339 PMCID: PMC10506762 DOI: 10.1016/j.ccell.2023.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 11/15/2022] [Accepted: 06/27/2023] [Indexed: 08/17/2023]
Abstract
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigates tumors from a proteogenomic perspective, creating rich multi-omics datasets connecting genomic aberrations to cancer phenotypes. To facilitate pan-cancer investigations, we have generated harmonized genomic, transcriptomic, proteomic, and clinical data for >1000 tumors in 10 cohorts to create a cohesive and powerful dataset for scientific discovery. We outline efforts by the CPTAC pan-cancer working group in data harmonization, data dissemination, and computational resources for aiding biological discoveries. We also discuss challenges for multi-omics data integration and analysis, specifically the unique challenges of working with both nucleotide sequencing and mass spectrometry proteomics data.
Collapse
Affiliation(s)
- Yize Li
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Yifat Geffen
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Anna P Calinawan
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - François Aguet
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Yo Akiyama
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Shankara Anand
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Chet Birger
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | - Song Cao
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | | | - Marcin Cieslik
- Department of Computational Medicine & Bioinformatics, Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Antonio Colaprico
- Department of Public Health Sciences, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Daniel Cui Zhou
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Corbin Day
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | | | - Myvizhi Esai Selvan
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - David Fenyö
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Steven M Foltz
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | - Tania Gonzalez-Robles
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Zeynep H Gümüş
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - David Heiman
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | | | - Runyu Hong
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Yingwei Hu
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Eric J Jaehnig
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jiayi Ji
- Tisch Cancer Institute and Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Wen Jiang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Lizabeth Katsnelson
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | | | - Robert J Klein
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jonathan T Lei
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Wen-Wei Liang
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Yuxing Liao
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Caleb M Lindgren
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Weiping Ma
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Lei Ma
- ICF, Rockville, MD 20850, USA
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Fernanda Martins Rodrigues
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Wilson McKerrow
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | | | - Robert Oldroyd
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | | | - Pietro Pugliese
- Department of Sciences and Technologies, University of Sannio, Benevento 82100, Italy
| | - Boris Reva
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Paul Rudnick
- Spectragen Informatics, Bainbridge Island, WA 98110, USA
| | - Kelly V Ruggles
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Dmitry Rykunov
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Michael Schnaubelt
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Tobias Schraink
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Zhiao Shi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Xiaoyu Song
- Tisch Cancer Institute and Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Erik Storrs
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Nadezhda V Terekhanova
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | | | - Liang-Bo Wang
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Joshua M Wang
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Ying Wang
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY 10016, USA; Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yige Wu
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Matthew A Wyczalkowski
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Yi Xin
- ICF, Rockville, MD 20850, USA
| | - Lijun Yao
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Xinpei Yi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Hui Zhang
- Department of Pathology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Qing Zhang
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA
| | | | - Gad Getz
- Broad Institute of MIT and Harvard, Cambridge, MA 02141, USA; Cancer Center and Department of Pathology, Mass. General Hospital, Boston, MA 02114, USA; Harvard Medical School, Boston, MA 02115, USA
| | - Li Ding
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63130, USA; Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63130, USA; Department of Genetics, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | - Pei Wang
- Department of Genetic and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Ana I Robles
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Rockville, MD 20850, USA.
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo, UT 84602, USA.
| |
Collapse
|
14
|
Yang KL, Yu F, Teo GC, Li K, Demichev V, Ralser M, Nesvizhskii AI. MSBooster: improving peptide identification rates using deep learning-based features. Nat Commun 2023; 14:4539. [PMID: 37500632 PMCID: PMC10374903 DOI: 10.1038/s41467-023-40129-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 07/06/2023] [Indexed: 07/29/2023] Open
Abstract
Peptide identification in liquid chromatography-tandem mass spectrometry (LC-MS/MS) experiments relies on computational algorithms for matching acquired MS/MS spectra against sequences of candidate peptides using database search tools, such as MSFragger. Here, we present a new tool, MSBooster, for rescoring peptide-to-spectrum matches using additional features incorporating deep learning-based predictions of peptide properties, such as LC retention time, ion mobility, and MS/MS spectra. We demonstrate the utility of MSBooster, in tandem with MSFragger and Percolator, in several different workflows, including nonspecific searches (immunopeptidomics), direct identification of peptides from data independent acquisition data, single-cell proteomics, and data generated on an ion mobility separation-enabled timsTOF MS platform. MSBooster is fast, robust, and fully integrated into the widely used FragPipe computational platform.
Collapse
Affiliation(s)
- Kevin L Yang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Kai Li
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Vadim Demichev
- Department of Biochemistry, Charité Universitätsmedizin, Berlin, Germany
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Markus Ralser
- Department of Biochemistry, Charité Universitätsmedizin, Berlin, Germany
- Nuffield Department of Medicine, The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Alexey I Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
15
|
Yu F, Teo GC, Kong AT, Fröhlich K, Li GX, Demichev V, Nesvizhskii AI. Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform. Nat Commun 2023; 14:4154. [PMID: 37438352 PMCID: PMC10338508 DOI: 10.1038/s41467-023-39869-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 06/28/2023] [Indexed: 07/14/2023] Open
Abstract
Liquid chromatography (LC) coupled with data-independent acquisition (DIA) mass spectrometry (MS) has been increasingly used in quantitative proteomics studies. Here, we present a fast and sensitive approach for direct peptide identification from DIA data, MSFragger-DIA, which leverages the unmatched speed of the fragment ion indexing-based search engine MSFragger. Different from most existing methods, MSFragger-DIA conducts a database search of the DIA tandem mass (MS/MS) spectra prior to spectral feature detection and peak tracing across the LC dimension. To streamline the analysis of DIA data and enable easy reproducibility, we integrate MSFragger-DIA into the FragPipe computational platform for seamless support of peptide identification and spectral library building from DIA, data-dependent acquisition (DDA), or both data types combined. We compare MSFragger-DIA with other DIA tools, such as DIA-Umpire based workflow in FragPipe, Spectronaut, DIA-NN library-free, and MaxDIA. We demonstrate the fast, sensitive, and accurate performance of MSFragger-DIA across a variety of sample types and data acquisition schemes, including single-cell proteomics, phosphoproteomics, and large-scale tumor proteome profiling studies.
Collapse
Affiliation(s)
- Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Andy T Kong
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Klemens Fröhlich
- Proteomics Core Facility, Biozentrum, University of Basel, Basel, Switzerland
| | - Ginny Xiaohe Li
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Vadim Demichev
- Department of Biochemistry, Charité - Universitätsmedizin Berlin, Berlin, Germany
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
16
|
Bell PA, Overall CM. No Substrate Left behind-Mining of Shotgun Proteomics Datasets Rescues Evidence of Proteolysis by SARS-CoV-2 3CL pro Main Protease. Int J Mol Sci 2023; 24:ijms24108723. [PMID: 37240067 DOI: 10.3390/ijms24108723] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 05/11/2023] [Accepted: 05/11/2023] [Indexed: 05/28/2023] Open
Abstract
Proteolytic processing is the most ubiquitous post-translational modification and regulator of protein function. To identify protease substrates, and hence the function of proteases, terminomics workflows have been developed to enrich and detect proteolytically generated protein termini from mass spectrometry data. The mining of shotgun proteomics datasets for such 'neo'-termini, to increase the understanding of proteolytic processing, is an underutilized opportunity. However, to date, this approach has been hindered by the lack of software with sufficient speed to make searching for the relatively low numbers of protease-generated semi-tryptic peptides present in non-enriched samples viable. We reanalyzed published shotgun proteomics datasets for evidence of proteolytic processing in COVID-19 using the recently upgraded MSFragger/FragPipe software, which searches data with a speed that is an order of magnitude greater than many equivalent tools. The number of protein termini identified was higher than expected and constituted around half the number of termini detected by two different N-terminomics methods. We identified neo-N- and C-termini generated during SARS-CoV-2 infection that were indicative of proteolysis and were mediated by both viral and host proteases-a number of which had been recently validated by in vitro assays. Thus, re-analyzing existing shotgun proteomics data is a valuable adjunct for terminomics research that can be readily tapped (for example, in the next pandemic where data would be scarce) to increase the understanding of protease function and virus-host interactions, or other diverse biological processes.
Collapse
Affiliation(s)
- Peter A Bell
- Department of Oral Biological and Medical Sciences, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
- Centre for Blood Research, Life Sciences Institute, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| | - Christopher M Overall
- Department of Oral Biological and Medical Sciences, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
- Centre for Blood Research, Life Sciences Institute, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
- Department of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, BC V6T 1Z3, Canada
| |
Collapse
|
17
|
Polasky DA, Geiszler DJ, Yu F, Li K, Teo GC, Nesvizhskii AI. MSFragger-Labile: A Flexible Method to Improve Labile PTM Analysis in Proteomics. Mol Cell Proteomics 2023; 22:100538. [PMID: 37004988 PMCID: PMC10182319 DOI: 10.1016/j.mcpro.2023.100538] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 03/24/2023] [Accepted: 03/28/2023] [Indexed: 04/03/2023] Open
Abstract
Posttranslational modifications of proteins play essential roles in defining and regulating the functions of the proteins they decorate, making identification of these modifications critical to understanding biology and disease. Methods for enriching and analyzing a wide variety of biological and chemical modifications of proteins have been developed using mass spectrometry-based proteomics, largely relying on traditional database search methods to identify the resulting mass spectra of modified peptides. These database search methods treat modifications as static attachments of a mass to particular position in the peptide sequence, but many modifications undergo fragmentation in tandem mass spectrometry experiments alongside, or instead of, the peptide backbone. While this fragmentation can confound traditional search methods, it also offers unique opportunities for improved searches that incorporate modification-specific fragment ions. Here, we present a new labile mode in the MSFragger search engine that provides the flexibility to tailor modification-centric searches to the fragmentation observed. We show that labile mode can dramatically improve spectrum identification rates of phosphopeptides, RNA-crosslinked peptides, and ADP-ribosylated peptides. Each of these modifications presents distinct fragmentation characteristics, showcasing the flexibility of MSFragger labile mode to improve search for a wide variety of biological and chemical modifications.
Collapse
Affiliation(s)
- Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA.
| | - Daniel J Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Kai Li
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.
| |
Collapse
|
18
|
Wen B, Zhang B. PepQuery2 democratizes public MS proteomics data for rapid peptide searching. Nat Commun 2023; 14:2213. [PMID: 37072382 PMCID: PMC10113256 DOI: 10.1038/s41467-023-37462-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 03/17/2023] [Indexed: 04/20/2023] Open
Abstract
We present PepQuery2, which leverages a new tandem mass spectrometry (MS/MS) data indexing approach to enable ultrafast, targeted identification of novel and known peptides in any local or publicly available MS proteomics datasets. The stand-alone version of PepQuery2 allows directly searching more than one billion indexed MS/MS spectra in the PepQueryDB or any public datasets from PRIDE, MassIVE, iProX, or jPOSTrepo, whereas the web version enables users to search datasets in PepQueryDB with a user-friendly interface. We demonstrate the utilities of PepQuery2 in a wide range of applications including detecting proteomic evidence for genomically predicted novel peptides, validating novel and known peptides identified using spectrum-centric database searching, prioritizing tumor-specific antigens, identifying missing proteins, and selecting proteotypic peptides for targeted proteomics experiments. By putting public MS proteomics data directly into the hands of scientists, PepQuery2 opens many new ways to transform these data into useful information for the broad research community.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
19
|
Wang P, Wu X, Shi Z, Tao S, Liu Z, Qi K, Xie Z, Qiao X, Gu C, Yin H, Cheng M, Gu X, Liu X, Tang C, Cao P, Xu S, Zhou B, Gu T, Bian Y, Wu J, Zhang S. A large-scale proteogenomic atlas of pear. MOLECULAR PLANT 2023; 16:599-615. [PMID: 36733253 DOI: 10.1016/j.molp.2023.01.011] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 01/10/2023] [Accepted: 01/30/2023] [Indexed: 06/18/2023]
Abstract
Pear is an important fruit tree that is widely distributed around the world. The first pear genome map was reported from our laboratory approximately 10 years ago. To further study global protein expression patterns in pear, we generated pear proteome data based on 24 major tissues. The tissue-resolved profiles provided evidence of the expression of 17 953 proteins. We identified 4294 new coding events and improved the pear genome annotation via the proteogenomic strategy based on 18 090 peptide spectra with peptide spectrum matches >1. Among the eight randomly selected new short coding open reading frames that were expressed in the style, four promoted and one inhibited the growth of pear pollen tubes. Based on gene coexpression module analysis, we explored the key genes associated with important agronomic traits, such as stone cell formation in fruits. The network regulating the synthesis of lignin, a major component of stone cells, was reconstructed, and receptor-like kinases were implicated as core factors in this regulatory network. Moreover, we constructed the online database PearEXP (http://www.peardb.org.cn) to enable access to the pear proteogenomic resources. This study provides a paradigm for in-depth proteogenomic studies of woody plants.
Collapse
Affiliation(s)
- Peng Wang
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiao Wu
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Zebin Shi
- Institute of Horticulture, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Shutian Tao
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Zhe Liu
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Kaijie Qi
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Zhihua Xie
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Xin Qiao
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Chao Gu
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Hao Yin
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Mengyu Cheng
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Xiaoyu Gu
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Xueying Liu
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Chao Tang
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Peng Cao
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | | | | | - Tingting Gu
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China
| | - Yangyang Bian
- College of Life Sciences, Northwest University, Xi'an 710127, China
| | - Juyou Wu
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China.
| | - Shaoling Zhang
- Sanya Institute of Nanjing Agricultural University, National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing 210095, China.
| |
Collapse
|
20
|
Post-translational modifications reshape the antigenic landscape of the MHC I immunopeptidome in tumors. Nat Biotechnol 2023; 41:239-251. [PMID: 36203013 DOI: 10.1038/s41587-022-01464-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 08/09/2022] [Indexed: 11/08/2022]
Abstract
Post-translational modification (PTM) of antigens provides an additional source of specificities targeted by immune responses to tumors or pathogens, but identifying antigen PTMs and assessing their role in shaping the immunopeptidome is challenging. Here we describe the Protein Modification Integrated Search Engine (PROMISE), an antigen discovery pipeline that enables the analysis of 29 different PTM combinations from multiple clinical cohorts and cell lines. We expanded the antigen landscape, uncovering human leukocyte antigen class I binding motifs defined by specific PTMs with haplotype-specific binding preferences and revealing disease-specific modified targets, including thousands of new cancer-specific antigens that can be shared between patients and across cancer types. Furthermore, we uncovered a subset of modified peptides that are specific to cancer tissue and driven by post-translational changes that occurred in the tumor proteome. Our findings highlight principles of PTM-driven antigenicity, which may have broad implications for T cell-mediated therapies in cancer and beyond.
Collapse
|
21
|
Yi X, Wen B, Ji S, Saltzman A, Jaehnig EJ, Lei JT, Gao Q, Zhang B. Deep learning prediction boosts phosphoproteomics-based discoveries through improved phosphopeptide identification. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.11.523329. [PMID: 36711982 PMCID: PMC9882090 DOI: 10.1101/2023.01.11.523329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Shotgun phosphoproteomics enables high-throughput analysis of phosphopeptides in biological samples, but low phosphopeptide identification rate in data analysis limits the potential of this technology. Here we present DeepRescore2, a computational workflow that leverages deep learning-based retention time and fragment ion intensity predictions to improve phosphopeptide identification and phosphosite localization. Using a state-of-the-art computational workflow as a benchmark, DeepRescore2 increases the number of correctly identified peptide-spectrum matches by 17% in a synthetic dataset and identifies 19%-46% more phosphopeptides in biological datasets. In a liver cancer dataset, 30% of the significantly altered phosphosites between tumor and normal tissues and 60% of the prognosis-associated phosphosites identified from DeepRescore2-processed data could not be identified based on the state-of-the-art workflow. Notably, DeepRescore2-processed data uniquely identifies EGFR hyperactivation as a new target in poor-prognosis liver cancer, which is validated experimentally. Integration of deep learning prediction in DeepRescore2 improves phosphopeptide identification and facilitates biological discoveries.
Collapse
|
22
|
Vašíček J, Skiadopoulou D, Kuznetsova KG, Wen B, Johansson S, Njølstad PR, Bruckner S, Käll L, Vaudel M. Finding haplotypic signatures in proteins. Gigascience 2022; 12:giad093. [PMID: 37919975 PMCID: PMC10622322 DOI: 10.1093/gigascience/giad093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 09/24/2023] [Accepted: 10/08/2023] [Indexed: 11/04/2023] Open
Abstract
BACKGROUND The nonrandom distribution of alleles of common genomic variants produces haplotypes, which are fundamental in medical and population genetic studies. Consequently, protein-coding genes with different co-occurring sets of alleles can encode different amino acid sequences: protein haplotypes. These protein haplotypes are present in biological samples and detectable by mass spectrometry, but they are not accounted for in proteomic searches. Consequently, the impact of haplotypic variation on the results of proteomic searches and the discoverability of peptides specific to haplotypes remain unknown. FINDINGS Here, we study how common genetic haplotypes influence the proteomic search space and investigate the possibility to match peptides containing multiple amino acid substitutions to a publicly available data set of mass spectra. We found that for 12.42% of the discoverable amino acid substitutions encoded by common haplotypes, 2 or more substitutions may co-occur in the same peptide after tryptic digestion of the protein haplotypes. We identified 352 spectra that matched to such multivariant peptides, and out of the 4,582 amino acid substitutions identified, 6.37% were covered by multivariant peptides. However, the evaluation of the reliability of these matches remains challenging, suggesting that refined error rate estimation procedures are needed for such complex proteomic searches. CONCLUSIONS As these procedures become available and the ability to analyze protein haplotypes increases, we anticipate that proteomics will provide new information on the consequences of common variation, across tissues and time.
Collapse
Affiliation(s)
- Jakub Vašíček
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen 5021, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5008, Norway
| | - Dafni Skiadopoulou
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen 5021, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5008, Norway
| | - Ksenia G Kuznetsova
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen 5021, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5008, Norway
| | - Bo Wen
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, United States
| | - Stefan Johansson
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen 5021, Norway
- Department of Medical Genetics, Haukeland University Hospital, Bergen 5021, Norway
| | - Pål R Njølstad
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen 5021, Norway
- Children and Youth Clinic, Haukeland University Hospital, Bergen 5021, Norway
| | - Stefan Bruckner
- Chair of Visual Analytics, Institute for Visual and Analytic Computing, University of Rostock, Rostock 18051, Germany
| | - Lukas Käll
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH–Royal Institute of Technology, Solna 17121, Sweden
| | - Marc Vaudel
- Mohn Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, Bergen 5021, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5008, Norway
- Department of Genetics and Bioinformatics, Health Data and Digitalization, Norwegian Institute of Public Health, Oslo 0473, Norway
| |
Collapse
|
23
|
Proteomic Profiling of Intra-Islet Features Reveals Substructure-Specific Protein Signatures. Mol Cell Proteomics 2022; 21:100426. [PMID: 36244662 PMCID: PMC9706166 DOI: 10.1016/j.mcpro.2022.100426] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 08/30/2022] [Accepted: 09/04/2022] [Indexed: 11/11/2022] Open
Abstract
Despite their diminutive size, islets of Langerhans play a large role in maintaining systemic energy balance in the body. New technologies have enabled us to go from studying the whole pancreas to isolated whole islets, to partial islet sections, and now to islet substructures isolated from within the islet. Using a microfluidic nanodroplet-based proteomics platform coupled with laser capture microdissection and field asymmetric waveform ion mobility spectrometry, we present an in-depth investigation of protein profiles specific to features within the islet. These features include the islet-acinar interface vascular tissue, inner islet vasculature, isolated endocrine cells, whole islet with vasculature, and acinar tissue from around the islet. Compared to interface vasculature, unique protein signatures observed in the inner vasculature indicate increased innervation and intra-islet neuron-like crosstalk. We also demonstrate the utility of these data for identifying localized structure-specific drug-target interactions using existing protein/drug binding databases.
Collapse
|
24
|
Yang Z, Wang Y, Liu S, Deng W, Lomeli SH, Moriceau G, Wohlschlegel J, Piva M, Lo RS. Enhancing PD-L1 Degradation by ITCH during MAPK Inhibitor Therapy Suppresses Acquired Resistance. Cancer Discov 2022; 12:1942-1959. [PMID: 35638972 PMCID: PMC9357203 DOI: 10.1158/2159-8290.cd-21-1463] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 04/23/2022] [Accepted: 05/25/2022] [Indexed: 01/09/2023]
Abstract
MAPK inhibitor (MAPKi) therapy in melanoma leads to the accumulation of tumor-surface PD-L1/L2, which may evade antitumor immunity and accelerate acquired resistance. Here, we discover that the E3 ligase ITCH binds, ubiquitinates, and downregulates tumor-surface PD-L1/L2 in MAPKi-treated human melanoma cells, thereby promoting T-cell activation. During MAPKi therapy in vivo, melanoma cell-intrinsic ITCH knockdown induced tumor-surface PD-L1, reduced intratumoral cytolytic CD8+ T cells, and accelerated acquired resistance only in immune-competent mice. Conversely, tumor cell-intrinsic ITCH overexpression reduced MAPKi-elicited PD-L1 accumulation, augmented intratumoral cytolytic CD8+ T cells, and suppressed acquired resistance in BrafV600MUT, NrasMUT, or Nf1MUT melanoma and KrasMUT-driven cancers. CD8+ T-cell depletion and tumor cell-intrinsic PD-L1 overexpression nullified the phenotype of ITCH overexpression, thereby supporting an in vivo ITCH-PD-L1-T-cell regulatory axis. Moreover, we identify a small-molecular ITCH activator that suppresses acquired MAPKi resistance in vivo. Thus, MAPKi-induced PD-L1 accelerates resistance, and a PD-L1-degrading ITCH activator prolongs antitumor response. SIGNIFICANCE MAPKi induces tumor cell-surface PD-L1 accumulation, which promotes immune evasion and therapy resistance. ITCH degrades PD-L1, optimizing antitumor T-cell immunity. We propose degrading tumor cell-surface PD-L1 and/or activating tumor-intrinsic ITCH as strategies to overcome MAPKi resistance. This article is highlighted in the In This Issue feature, p. 1825.
Collapse
Affiliation(s)
- Zhentao Yang
- Division of Dermatology, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Yan Wang
- Division of Dermatology, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Sixue Liu
- Division of Dermatology, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Weixian Deng
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Shirley H. Lomeli
- Division of Dermatology, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Gatien Moriceau
- Division of Dermatology, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - James Wohlschlegel
- Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Marco Piva
- Division of Dermatology, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Roger S. Lo
- Division of Dermatology, Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
- Department of Molecular and Medical Pharmacology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
- Jonsson Comprehensive Cancer Center, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| |
Collapse
|
25
|
NPvis: An Interactive Visualizer of Peptidic Natural Product–MS/MS Matches. Metabolites 2022; 12:metabo12080706. [PMID: 36005578 PMCID: PMC9415073 DOI: 10.3390/metabo12080706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 07/22/2022] [Accepted: 07/27/2022] [Indexed: 11/16/2022] Open
Abstract
Peptidic natural products (PNPs) represent a medically important class of secondary metabolites that includes antibiotics, anti-inflammatory and antitumor agents. Advances in tandem mass spectra (MS/MS) acquisition and in silico database search methods have enabled high-throughput PNP discovery. However, the resulting spectra annotations are often error-prone and their validation remains a bottleneck. Here, we present NPvis, a visualizer suitable for the evaluation of PNP–MS/MS matches. The tool interactively maps annotated spectrum peaks to the corresponding PNP fragments and allows researchers to assess the match correctness. NPvis accounts for the wide chemical diversity of PNPs that prevents the use of the existing proteomics visualizers. Moreover, NPvis works even if the exact chemical structure of the matching PNP is unknown. The tool is available online and as a standalone application. We hope that it will benefit the community by streamlining PNP data analysis and validation.
Collapse
|
26
|
Xiang H, Zhang L, Bu F, Guan X, Chen L, Zhang H, Zhao Y, Chen H, Zhang W, Li Y, Lee LJ, Mei Z, Rao Y, Gu Y, Hou Y, Mu F, Dong X. A Novel Proteogenomic Integration Strategy Expands the Breadth of Neo-Epitope Sources. Cancers (Basel) 2022; 14:cancers14123016. [PMID: 35740681 PMCID: PMC9220843 DOI: 10.3390/cancers14123016] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Revised: 06/09/2022] [Accepted: 06/13/2022] [Indexed: 11/16/2022] Open
Abstract
Tumor-specific antigens can activate T cell-based antitumor immune responses and are ideal targets for cancer immunotherapy. However, their identification is still challenging. Although mass spectrometry can directly identify human leukocyte antigen (HLA) binding peptides in tumor cells, it focuses on tumor-specific antigens derived from annotated protein-coding regions constituting only 1.5% of the genome. We developed a novel proteogenomic integration strategy to expand the breadth of tumor-specific epitopes derived from all genomic regions. Using the colorectal cancer cell line HCT116 as a model, we accurately identified 10,737 HLA-presented peptides, 1293 of which were non-canonical peptides that traditional database searches could not identify. Moreover, we found eight tumor neo-epitopes derived from somatic mutations, four of which were not previously reported. Our findings suggest that this new proteogenomic approach holds great promise for increasing the number of tumor-specific antigen candidates, potentially enlarging the tumor target pool and improving cancer immunotherapy.
Collapse
Affiliation(s)
- Haitao Xiang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; (H.X.); (X.G.); (W.Z.); (Y.L.)
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Le Zhang
- BGI-GenoImmune, BGI-Shenzhen, Shenzhen 518083, China; (L.Z.); (L.J.L.)
| | - Fanyu Bu
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Xiangyu Guan
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; (H.X.); (X.G.); (W.Z.); (Y.L.)
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Lei Chen
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Haibo Zhang
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Yuntong Zhao
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Huanyi Chen
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Weicong Zhang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; (H.X.); (X.G.); (W.Z.); (Y.L.)
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
| | - Yijian Li
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; (H.X.); (X.G.); (W.Z.); (Y.L.)
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
- Guangdong Provincial Key Laboratory of Human Disease Genomics, Shenzhen Key Laboratory of Genomics, Shenzhen 518083, China
| | - Leo Jingyu Lee
- BGI-GenoImmune, BGI-Shenzhen, Shenzhen 518083, China; (L.Z.); (L.J.L.)
| | - Zhanlong Mei
- BGI, Shenzhen 518083, China; (Z.M.); (Y.R.); (Y.H.)
| | - Yuan Rao
- BGI, Shenzhen 518083, China; (Z.M.); (Y.R.); (Y.H.)
| | - Ying Gu
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen 518120, China
| | - Yong Hou
- BGI, Shenzhen 518083, China; (Z.M.); (Y.R.); (Y.H.)
| | - Feng Mu
- BGI, Shenzhen 518083, China; (Z.M.); (Y.R.); (Y.H.)
- Correspondence: (F.M.); (X.D.)
| | - Xuan Dong
- BGI-Shenzhen, Shenzhen 518103, China; (F.B.); (L.C.); (H.Z.); (Y.Z.); (H.C.); (Y.G.)
- Guangdong Provincial Key Laboratory of Human Disease Genomics, Shenzhen Key Laboratory of Genomics, Shenzhen 518083, China
- Correspondence: (F.M.); (X.D.)
| |
Collapse
|
27
|
IntroSpect: Motif-Guided Immunopeptidome Database Building Tool to Improve the Sensitivity of HLA I Binding Peptide Identification by Mass Spectrometry. Biomolecules 2022; 12:biom12040579. [PMID: 35454168 PMCID: PMC9025654 DOI: 10.3390/biom12040579] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 04/11/2022] [Accepted: 04/12/2022] [Indexed: 01/02/2023] Open
Abstract
Although database search tools originally developed for shotgun proteome have been widely used in immunopeptidomic mass spectrometry identifications, they have been reported to achieve undesirably low sensitivities or high false positive rates as a result of the hugely inflated search space caused by the lack of specific enzymic digestions in immunopeptidome. To overcome such a problem, we developed a motif-guided immunopeptidome database building tool named IntroSpect, which is designed to first learn the peptide motifs from high confidence hits in the initial search, and then build a targeted database for refined search. Evaluated on 18 representative HLA class I datasets, IntroSpect can improve the sensitivity by an average of 76%, compared to conventional searches with unspecific digestions, while maintaining a very high level of accuracy (~96%), as confirmed by synthetic validation experiments. A distinct advantage of IntroSpect is that it does not depend on any external HLA data, so that it performs equally well on both well-studied and poorly-studied HLA types, unlike the previously developed method SpectMHC. We have also designed IntroSpect to keep a global FDR that can be conveniently controlled, similar to a conventional database search. Finally, we demonstrate the practical value of IntroSpect by discovering neoepitopes from MS data directly, an important application in cancer immunotherapies. IntroSpect is freely available to download and use.
Collapse
|
28
|
Rajczewski AT, Han Q, Mehta S, Kumar P, Jagtap PD, Knutson CG, Fox JG, Tretyakova NY, Griffin TJ. Quantitative Proteogenomic Characterization of Inflamed Murine Colon Tissue Using an Integrated Discovery, Verification, and Validation Proteogenomic Workflow. Proteomes 2022; 10:proteomes10020011. [PMID: 35466239 PMCID: PMC9036229 DOI: 10.3390/proteomes10020011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 03/27/2022] [Accepted: 04/07/2022] [Indexed: 11/24/2022] Open
Abstract
Chronic inflammation of the colon causes genomic and/or transcriptomic events, which can lead to expression of non-canonical protein sequences contributing to oncogenesis. To better understand these mechanisms, Rag2−/−Il10−/− mice were infected with Helicobacter hepaticus to induce chronic inflammation of the cecum and the colon. Transcriptomic data from harvested proximal colon samples were used to generate a customized FASTA database containing non-canonical protein sequences. Using a proteogenomic approach, mass spectrometry data for proximal colon proteins were searched against this custom FASTA database using the Galaxy for Proteomics (Galaxy-P) platform. In addition to the increased abundance in inflammatory response proteins, we also discovered several non-canonical peptide sequences derived from unique proteoforms. We confirmed the veracity of these novel sequences using an automated bioinformatics verification workflow with targeted MS-based assays for peptide validation. Our bioinformatics discovery workflow identified 235 putative non-canonical peptide sequences, of which 58 were verified with high confidence and 39 were validated in targeted proteomics assays. This study provides insights into challenges faced when identifying non-canonical peptides using a proteogenomics approach and demonstrates an integrated workflow addressing these challenges. Our bioinformatic discovery and verification workflow is publicly available and accessible via the Galaxy platform and should be valuable in non-canonical peptide identification using proteogenomics.
Collapse
Affiliation(s)
- Andrew T. Rajczewski
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Qiyuan Han
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Praveen Kumar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
| | - Charles G. Knutson
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; (C.G.K.); (J.G.F.)
| | - James G. Fox
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; (C.G.K.); (J.G.F.)
| | - Natalia Y. Tretyakova
- Department of Medicinal Chemistry, the Masonic Cancer Center, University of Minnesota, Minneapolis, MN 55455, USA;
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA; (A.T.R.); (Q.H.); (S.M.); (P.K.); (P.D.J.)
- Correspondence:
| |
Collapse
|
29
|
Hari PS, Balakrishnan L, Kotyada C, Everad John A, Tiwary S, Shah N, Sirdeshmukh R. Proteogenomic Analysis of Breast Cancer Transcriptomic and Proteomic Data, Using De Novo Transcript Assembly: Genome-Wide Identification of Novel Peptides and Clinical Implications. Mol Cell Proteomics 2022; 21:100220. [PMID: 35227895 PMCID: PMC9020135 DOI: 10.1016/j.mcpro.2022.100220] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 01/16/2022] [Accepted: 02/24/2022] [Indexed: 11/30/2022] Open
Abstract
We have carried out proteogenomic analysis of the breast cancer transcriptomic and proteomic data, available at The Clinical Proteomic Tumor Analysis Consortium resource, to identify novel peptides arising from alternatively spliced events as well as other noncanonical expressions. We used a pipeline that consisted of de novo transcript assembly, six frame-translated custom database, and a combination of search engines to identify novel peptides. A portfolio of 4,387 novel peptide sequences initially identified was further screened through PepQuery validation tool (Clinical Proteomic Tumor Analysis Consortium), which yielded 1,558 novel peptides. We considered the dataset of 1,558 validated through PepQuery to understand their functional and clinical significance, leaving the rest to be further verified using other validation tools and approaches. The novel peptides mapped to the known gene sequences as well as to genomic regions yet undefined for translation, 580 novel peptides mapped to known protein-coding genes, 147 to non–protein-coding genes, and 831 belonged to novel translational sequences. The novel peptides belonging to protein-coding genes represented alternatively spliced events or 5′ or 3′ extensions, whereas others represented translation from pseudogenes, long noncoding RNAs, or novel peptides originating from uncharacterized protein-coding sequences—mostly from the intronic regions of known genes. Seventy-six of the 580 protein-coding genes were associated with cancer hallmark genes, which included key oncogenes, transcription factors, kinases, and cell surface receptors. Survival association analysis of the 76 novel peptide sequences revealed 10 of them to be significant, and we present a panel of six novel peptides, whose high expression was found to be strongly associated with poor survival of patients with human epidermal growth factor receptor 2–enriched subtype. Our analysis represents a landscape of novel peptides of different types that may be expressed in breast cancer tissues, whereas their presence in full-length functional proteins needs further investigations. Novel protein variants and peptides from noncoding sequences are rapidly emerging. Mining of mass spectrometry data using proteogenomic analysis reveals such entities. Novel peptides from coding and noncoding sequences identified in breast cancer. Novel peptides mapped to cancer hallmark genes in breast cancer. Panel of novel peptides with prognostic potential found for HER2-enriched subtype.
Collapse
Affiliation(s)
- P S Hari
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India
| | - Lavanya Balakrishnan
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India
| | - Chaithanya Kotyada
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India
| | | | - Shivani Tiwary
- Simulation and Modeling Sciences, Pfizer Pharma GmBH, Berlin, Germany
| | - Nameeta Shah
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India.
| | - Ravi Sirdeshmukh
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India; Institute of Bioinformatics, International Tech Park, Bangalore, India; Health Sciences, Manipal Academy of Higher Education, Manipal, India.
| |
Collapse
|
30
|
Yan T, Palmer AB, Geiszler DJ, Polasky DA, Boatner LM, Burton NR, Armenta E, Nesvizhskii AI, Backus KM. Enhancing Cysteine Chemoproteomic Coverage through Systematic Assessment of Click Chemistry Product Fragmentation. Anal Chem 2022; 94:3800-3810. [PMID: 35195394 DOI: 10.1021/acs.analchem.1c04402] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Mass spectrometry-based chemoproteomics has enabled functional analysis and small molecule screening at thousands of cysteine residues in parallel. Widely adopted chemoproteomic sample preparation workflows rely on the use of pan cysteine-reactive probes such as iodoacetamide alkyne combined with biotinylation via copper-catalyzed azide-alkyne cycloaddition (CuAAC) or "click chemistry" for cysteine capture. Despite considerable advances in both sample preparation and analytical platforms, current techniques only sample a small fraction of all cysteines encoded in the human proteome. Extending the recently introduced labile mode of the MSFragger search engine, here we report an in-depth analysis of cysteine biotinylation via click chemistry (CBCC) reagent gas-phase fragmentation during MS/MS analysis. We find that CBCC conjugates produce both known and novel diagnostic fragments and peptide remainder ions. Among these species, we identified a candidate signature ion for CBCC peptides, the cyclic oxonium-biotin fragment ion that is generated upon fragmentation of the N(triazole)-C(alkyl) bond. Guided by our empirical comparison of fragmentation patterns of six CBCC reagent combinations, we achieved enhanced coverage of cysteine-labeled peptides. Implementation of labile searches afforded unique PSMs and provides a roadmap for the utility of such searches in enhancing chemoproteomic peptide coverage.
Collapse
Affiliation(s)
- Tianyang Yan
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Andrew B Palmer
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Daniel J Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Lisa M Boatner
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Nikolas R Burton
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Ernest Armenta
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| | - Alexey I Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Keriann M Backus
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, California 90095, United States
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, California 90095, United States
| |
Collapse
|
31
|
Polasky DA, Geiszler DJ, Yu F, Nesvizhskii AI. Multi-attribute Glycan Identification and FDR Control for Glycoproteomics. Mol Cell Proteomics 2022; 21:100205. [PMID: 35091091 PMCID: PMC8933705 DOI: 10.1016/j.mcpro.2022.100205] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 01/10/2022] [Accepted: 01/20/2022] [Indexed: 11/18/2022] Open
Abstract
Rapidly improving methods for glycoproteomics have enabled increasingly large-scale analyses of complex glycopeptide samples, but annotating the resulting mass spectrometry data with high confidence remains a major bottleneck. We recently introduced a fast and sensitive glycoproteomics search method in our MSFragger search engine, which reports glycopeptides as a combination of a peptide sequence and the mass of the attached glycan. In samples with complex glycosylation patterns, converting this mass to a specific glycan composition is not straightforward; however, as many glycans have similar or identical masses. Here, we have developed a new method for determining the glycan composition of N-linked glycopeptides fragmented by collisional or hybrid activation that uses multiple sources of information from the spectrum, including observed glycan B-type (oxonium) and Y-type ions and mass and precursor monoisotopic selection errors to discriminate between possible glycan candidates. Combined with false discovery rate estimation for the glycan assignment, we show that this method is capable of specifically and sensitively identifying glycans in complex glycopeptide analyses and effectively controls the rate of false glycan assignments. The new method has been incorporated into the PTM-Shepherd modification analysis tool to work directly with the MSFragger glyco search in the FragPipe graphical user interface, providing a complete computational pipeline for annotation of N-glycopeptide spectra with false discovery rate control of both peptide and glycan components that is both sensitive and robust against false identifications. Identifying the glycan on intact glycopeptides remains difficult in glycoproteomics. We developed a method to assign glycan compositions in N-glycoproteomics searches. We demonstrate well-controlled glycan FDR in multiple sample types. The method annotates more glycopeptide spectra than competing tools. The method is included PTM-Shepherd for a full glycoproteomics workflow in FragPipe.
Collapse
Affiliation(s)
- Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Daniel J Geiszler
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.
| |
Collapse
|
32
|
Yi X, Liao Y, Wen B, Li K, Dou Y, Savage SR, Zhang B. caAtlas: An immunopeptidome atlas of human cancer. iScience 2021; 24:103107. [PMID: 34622160 PMCID: PMC8479791 DOI: 10.1016/j.isci.2021.103107] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 08/10/2021] [Accepted: 09/03/2021] [Indexed: 01/24/2023] Open
Abstract
Comprehensive characterization of tumor antigens is essential for the design of cancer immunotherapies, and mass spectrometry (MS)-based immunopeptidomics enables high-throughput identification of major histocompatibility complex (MHC)-bound peptide antigens in vivo. Here we construct an immunopeptidome atlas of human cancer through an extensive collection of 43 published immunopeptidomic datasets and standardized analysis of 81.6 million MS/MS spectra using an open search engine. Our analysis greatly expands the current knowledge of MHC-bound antigens, including an unprecedented characterization of post-translationally modified antigens and their cancer-association. We also perform systematic analysis of cancer-testis antigens, cancer-associated antigens, and neoantigens. We make all these data together with annotated MS/MS spectra supporting identification of each antigen in an easily browsable web portal named cancer antigen atlas (caAtlas). caAtlas provides a central resource for the selection and prioritization of MHC-bound peptides for in vitro HLA binding assay and immunogenicity testing, which will pave the way to eventual development of cancer immunotherapies. Extensive collection of 43 immunopeptidomic datasets with 1018 samples Standardized and rigorous identification of HLA-bound peptides, including PTM peptides Comprehensive annotation of CT antigens and cancer-associated antigens User-friendly data dissemination through the caAtlas web portal
Collapse
Affiliation(s)
- Xinpei Yi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yuxing Liao
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
33
|
Nomura Y, Dohmae N. Discovery of a small protein-encoding cis-regulatory overlapping gene of the tumor suppressor gene Scribble in humans. Commun Biol 2021; 4:1098. [PMID: 34535749 PMCID: PMC8448870 DOI: 10.1038/s42003-021-02619-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 08/30/2021] [Indexed: 12/26/2022] Open
Abstract
Intensive gene annotation has revealed many functional and regulatory elements in the human genome. Although eukaryotic protein-coding genes are generally transcribed into monocistronic mRNAs, recent studies have discovered additional short open reading frames (sORFs) in mRNAs. Here, we performed proteogenomic data mining for hidden proteins categorized into sORF-encoded polypeptides (SEPs) in human cancers. We identified a new SEP-encoding overlapping sORF (oORF) on the cell polarity determinant Scribble (SCRIB) that is considered a proto-oncogene with tumor suppressor function in Hippo-YAP/TAZ, MAPK/ERK, and PI3K/Akt/mTOR signaling. Reanalysis of clinical human proteomic data revealed translational dysregulation of both SCRIB and its oORF, oSCRIB, during carcinogenesis. Biochemical analyses suggested that the translatable oSCRIB constitutively limits the capacity of eukaryotic ribosomes to translate the downstream SCRIB. These findings provide a new example of cis-regulatory oORFs that function as a ribosomal roadblock and potentially serve as a fail-safe mechanism to normal cells for non-excessive downstream gene expression, which is hijacked in cancer. Yuhta Nomura and Naoshi Dohmae report the discovery of a small protein-coding gene that overlaps the tumor suppressor gene Scribble. Their data suggest that the overlapping gene, oSCRIB, limits the translation of downstream Scribble and may have important implications in cancer.
Collapse
Affiliation(s)
- Yuhta Nomura
- Biomolecular Characterization Unit, RIKEN Center for Sustainable Resource Science, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan.
| | - Naoshi Dohmae
- Biomolecular Characterization Unit, RIKEN Center for Sustainable Resource Science, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan.
| |
Collapse
|
34
|
Atkinson SC, Heaton SM, Audsley MD, Kleifeld O, Borg NA. TRIM25 and DEAD-Box RNA Helicase DDX3X Cooperate to Regulate RIG-I-Mediated Antiviral Immunity. Int J Mol Sci 2021; 22:9094. [PMID: 34445801 PMCID: PMC8396550 DOI: 10.3390/ijms22169094] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 08/18/2021] [Accepted: 08/18/2021] [Indexed: 12/25/2022] Open
Abstract
The cytoplasmic retinoic acid-inducible gene-I (RIG-I)-like receptors (RLRs) initiate interferon (IFN) production and antiviral gene expression in response to RNA virus infection. Consequently, RLR signalling is tightly regulated by both host and viral factors. Tripartite motif protein 25 (TRIM25) is an E3 ligase that ubiquitinates multiple substrates within the RLR signalling cascade, playing both ubiquitination-dependent and -independent roles in RIG-I-mediated IFN induction. However, additional regulatory roles are emerging. Here, we show a novel interaction between TRIM25 and another protein in the RLR pathway that is essential for type I IFN induction, DEAD-box helicase 3X (DDX3X). In vitro assays and knockdown studies reveal that TRIM25 ubiquitinates DDX3X at lysine 55 (K55) and that TRIM25 and DDX3X cooperatively enhance IFNB1 induction following RIG-I activation, but the latter is independent of TRIM25's catalytic activity. Furthermore, we found that the influenza A virus non-structural protein 1 (NS1) disrupts the TRIM25:DDX3X interaction, abrogating both TRIM25-mediated ubiquitination of DDX3X and cooperative activation of the IFNB1 promoter. Thus, our results reveal a new interplay between two RLR-host proteins that cooperatively enhance IFN-β production. We also uncover a new and further mechanism by which influenza A virus NS1 suppresses host antiviral defence.
Collapse
Affiliation(s)
- Sarah C. Atkinson
- Immunity and Immune Evasion Laboratory, Chronic Infectious and Inflammatory Diseases Research, School of Health and Biomedical Sciences, RMIT University, Bundoora, VIC 3083, Australia; (S.C.A.); (M.D.A.)
- Infection & Immunity Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia;
| | - Steven M. Heaton
- Infection & Immunity Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia;
| | - Michelle D. Audsley
- Immunity and Immune Evasion Laboratory, Chronic Infectious and Inflammatory Diseases Research, School of Health and Biomedical Sciences, RMIT University, Bundoora, VIC 3083, Australia; (S.C.A.); (M.D.A.)
- Infection & Immunity Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia;
| | - Oded Kleifeld
- Faculty of Biology, Technion-Israel Institute of Technology, Haifa 32000, Israel;
| | - Natalie A. Borg
- Immunity and Immune Evasion Laboratory, Chronic Infectious and Inflammatory Diseases Research, School of Health and Biomedical Sciences, RMIT University, Bundoora, VIC 3083, Australia; (S.C.A.); (M.D.A.)
- Infection & Immunity Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia;
| |
Collapse
|
35
|
Satpathy S, Krug K, Jean Beltran PM, Savage SR, Petralia F, Kumar-Sinha C, Dou Y, Reva B, Kane MH, Avanessian SC, Vasaikar SV, Krek A, Lei JT, Jaehnig EJ, Omelchenko T, Geffen Y, Bergstrom EJ, Stathias V, Christianson KE, Heiman DI, Cieslik MP, Cao S, Song X, Ji J, Liu W, Li K, Wen B, Li Y, Gümüş ZH, Selvan ME, Soundararajan R, Visal TH, Raso MG, Parra ER, Babur Ö, Vats P, Anand S, Schraink T, Cornwell M, Rodrigues FM, Zhu H, Mo CK, Zhang Y, da Veiga Leprevost F, Huang C, Chinnaiyan AM, Wyczalkowski MA, Omenn GS, Newton CJ, Schurer S, Ruggles KV, Fenyö D, Jewell SD, Thiagarajan M, Mesri M, Rodriguez H, Mani SA, Udeshi ND, Getz G, Suh J, Li QK, Hostetter G, Paik PK, Dhanasekaran SM, Govindan R, Ding L, Robles AI, Clauser KR, Nesvizhskii AI, Wang P, Carr SA, Zhang B, Mani DR, Gillette MA. A proteogenomic portrait of lung squamous cell carcinoma. Cell 2021; 184:4348-4371.e40. [PMID: 34358469 PMCID: PMC8475722 DOI: 10.1016/j.cell.2021.07.016] [Citation(s) in RCA: 151] [Impact Index Per Article: 50.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 04/26/2021] [Accepted: 07/12/2021] [Indexed: 02/07/2023]
Abstract
Lung squamous cell carcinoma (LSCC) remains a leading cause of cancer death with few therapeutic options. We characterized the proteogenomic landscape of LSCC, providing a deeper exposition of LSCC biology with potential therapeutic implications. We identify NSD3 as an alternative driver in FGFR1-amplified tumors and low-p63 tumors overexpressing the therapeutic target survivin. SOX2 is considered undruggable, but our analyses provide rationale for exploring chromatin modifiers such as LSD1 and EZH2 to target SOX2-overexpressing tumors. Our data support complex regulation of metabolic pathways by crosstalk between post-translational modifications including ubiquitylation. Numerous immune-related proteogenomic observations suggest directions for further investigation. Proteogenomic dissection of CDKN2A mutations argue for more nuanced assessment of RB1 protein expression and phosphorylation before declaring CDK4/6 inhibition unsuccessful. Finally, triangulation between LSCC, LUAD, and HNSCC identified both unique and common therapeutic vulnerabilities. These observations and proteogenomics data resources may guide research into the biology and treatment of LSCC.
Collapse
Affiliation(s)
- Shankha Satpathy
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA.
| | - Karsten Krug
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Pierre M Jean Beltran
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Francesca Petralia
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | | | - Yongchao Dou
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Boris Reva
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - M Harry Kane
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Shayan C Avanessian
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Suhas V Vasaikar
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Azra Krek
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jonathan T Lei
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Eric J Jaehnig
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Yifat Geffen
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Erik J Bergstrom
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Vasileios Stathias
- Sylvester Comprehensive Cancer Center and Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, FL 33136, USA
| | - Karen E Christianson
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - David I Heiman
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Marcin P Cieslik
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Song Cao
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Xiaoyu Song
- Department of Population Health Science and Policy, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jiayi Ji
- Department of Population Health Science and Policy, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Wenke Liu
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yize Li
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Zeynep H Gümüş
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Myvizhi Esai Selvan
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Rama Soundararajan
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Tanvi H Visal
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Maria G Raso
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Edwin Roger Parra
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Özgün Babur
- Computer Science Department, University of Massachusetts Boston, Boston, MA 02125, USA
| | - Pankaj Vats
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Shankara Anand
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Tobias Schraink
- Institute for Systems Genetics and Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - MacIntosh Cornwell
- Institute for Systems Genetics and Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | | | - Houxiang Zhu
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Chia-Kuei Mo
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Yuping Zhang
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Chen Huang
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Arul M Chinnaiyan
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Stephan Schurer
- Sylvester Comprehensive Cancer Center and Department of Molecular and Cellular Pharmacology, Miller School of Medicine, University of Miami, Miami, FL 33136, USA
| | - Kelly V Ruggles
- Institute for Systems Genetics and Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - David Fenyö
- Institute for Systems Genetics and Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Scott D Jewell
- Van Andel Research Institute, Grand Rapids, MI 49503, USA
| | - Mathangi Thiagarajan
- Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Mehdi Mesri
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Sendurai A Mani
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Namrata D Udeshi
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Gad Getz
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - James Suh
- Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Qing Kay Li
- Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins Medical Institutions, Baltimore, MD 21224, USA
| | | | - Paul K Paik
- Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | | | - Ramaswamy Govindan
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Li Ding
- Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO 63110, USA
| | - Ana I Robles
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Karl R Clauser
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Pei Wang
- Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Steven A Carr
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA.
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| | - D R Mani
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA.
| | - Michael A Gillette
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA; Division of Pulmonary and Critical Care Medicine, Massachusetts General Hospital, Boston, MA 02115, USA.
| |
Collapse
|
36
|
Ebrahimi N, Akbari M, Ghanaatian M, Roozbahani Moghaddam P, Adelian S, Borjian Boroujeni M, Yazdani E, Ahmadi A, Hamblin MR. Development of neoantigens: from identification in cancer cells to application in cancer vaccines. Expert Rev Vaccines 2021; 21:941-955. [PMID: 34196590 DOI: 10.1080/14760584.2021.1951246] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Introduction: The discovery of neoantigens as mutated proteins specifically expressed in tumor cells but not in normal cells has led to improved cancer vaccines. Targeting neoantigens can induce anti-tumor T-cell responses to destroy tumors without damaging healthy cells. Extensive advances in genome sequencing technology and bioinformatics analysis have made it possible to discover and design effective neoantigens for use in therapeutic cancer vaccines. Neoantigens-based therapeutic personalized vaccines have shown promising results in cancer immunotherapy.Areas covered: We discuss the types of cancer neoantigens that can be recognized by the immune system in this review. We also summarize the detection, identification, and design of neoantigens and their appliction in developing cancer vaccines. Finally, clinical trials of neoantigen-based vaccines, their advantages, and their limitations are reviewed. From 2015 to 2020, the authors conducted a literature search of controlled randomized trials and laboratory investigations that that focused on neoantigens, their use in the design of various types of cancer vaccines.Expert opinion: Neoantigens are cancer cell-specific antigens, which their expression leads to the immune stimulation against tumor cells. The identification and delivery of specific neoantigens to antigen-presenting cells (APCs) with the help of anti-cancer vaccines promise novel and more effective cancer treatments.
Collapse
Affiliation(s)
- Nasim Ebrahimi
- Division of Genetics, Department Cell, and Molecular Biology & Microbiology, Faculty of Science and Technology, University of Isfahan, Isfahan, Iran
| | - Maryam Akbari
- Department of Immunology, Asthma and Allergy Research Institute, Tehran University of Medical Sciences, Tehran, Iran
| | - Masoud Ghanaatian
- Department of Microbiology, Islamic Azad University of Jahrom, Fars, Iran
| | | | - Samaneh Adelian
- Department of Genetics, Shahrekord University of Medical Sciences, Shahrekord, Iran
| | | | - Elnaz Yazdani
- Department of Biology, Faculty of Science, University Of Isfahan, Isfahan, Iran
| | - Amirhossein Ahmadi
- Department of Biological Science and Technology, Faculty of Nano and Bio Science and Technology, Persian Gulf University, Bushehr, Iran
| | - Michael R Hamblin
- Laser Research Centre, Faculty of Health Science, University of Johannesburg, Doornfontein, South Africa
| |
Collapse
|
37
|
Yang X, Neta P, Mirokhin YA, Tchekhovskoi DV, Remoroza CA, Burke MC, Liang Y, Markey SP, Stein SE. MS_Piano: A Software Tool for Annotating Peaks in CID Tandem Mass Spectra of Peptides and N-Glycopeptides. J Proteome Res 2021; 20:4603-4609. [PMID: 34264676 DOI: 10.1021/acs.jproteome.1c00324] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Annotating product ion peaks in tandem mass spectra is essential for evaluating spectral quality and validating peptide identification. This task is more complex for glycopeptides and is crucial for the confident determination of glycosylation sites in glycoproteins. MS_Piano (Mass Spectrum Peptide Annotation) software was developed for reliable annotation of peaks in collision induced dissociation (CID) tandem mass spectra of peptides or N-glycopeptides for given peptide sequences, charge states, and optional modifications. The program annotates each peak in high or low resolution spectra with possible product ion(s) and the mass difference between the measured and theoretical m/z values. Spectral quality is measured by two major parameters: the ratio between the sum of unannotated vs all peak intensities in the top 20 peaks, and the intensity of the highest unannotated peak. The product ions of peptides, glycans, and glycopeptides in spectra are labeled in different class-type colors to facilitate interpretation. MS_Piano assists validating peptide and N-glycopeptide identification from database and library searches and provides quality control and optimizes search reliability in custom developed peptide mass spectral libraries. The software is freely available in .exe and .dll formats for the Windows operating system.
Collapse
Affiliation(s)
- Xiaoyu Yang
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Pedatsur Neta
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Yuri A Mirokhin
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Dmitrii V Tchekhovskoi
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Concepcion A Remoroza
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Meghan C Burke
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Yuxue Liang
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Sanford P Markey
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| | - Stephen E Stein
- Mass Spectrometry Data Center, National Institute of Standards and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United States
| |
Collapse
|
38
|
Choi IK, Jiang T, Kankara SR, Wu S, Liu X. TopMSV: A Web-Based Tool for Top-Down Mass Spectrometry Data Visualization. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:1312-1318. [PMID: 33780241 PMCID: PMC8172439 DOI: 10.1021/jasms.0c00460] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Top-down mass spectrometry (MS) investigates intact proteoforms for proteoform identification, characterization, and quantification. Data visualization plays an essential role in top-down MS data analysis because proteoform identification and characterization often involve manual data inspection to determine the molecular masses of highly charged ions and validate unexpected alterations in identified proteoforms. While many software tools have been developed for MS data visualization, there is still a lack of web-based visualization software designed for top-down MS. Here, we present TopMSV, a web-based tool for top-down MS data processing and visualization. TopMSV provides interactive views of top-down MS data using a web browser. It integrates software tools for spectral deconvolution and proteoform identification and uses analysis results of the tools to annotate top-down MS data.
Collapse
Affiliation(s)
- In Kwon Choi
- Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
| | - Tianze Jiang
- Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
| | - Sreekanth Reddy Kankara
- Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
| | - Si Wu
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, Oklahoma 73019, United States
| | - Xiaowen Liu
- Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana 46202, United States
| |
Collapse
|
39
|
Rajczewski AT, Mehta S, Nguyen DDA, Grüning B, Johnson JE, McGowan T, Griffin TJ, Jagtap PD. A rigorous evaluation of optimal peptide targets for MS-based clinical diagnostics of Coronavirus Disease 2019 (COVID-19). Clin Proteomics 2021; 18:15. [PMID: 33971807 PMCID: PMC8107781 DOI: 10.1186/s12014-021-09321-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 05/01/2021] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The Coronavirus Disease 2019 (COVID-19) global pandemic has had a profound, lasting impact on the world's population. A key aspect to providing care for those with COVID-19 and checking its further spread is early and accurate diagnosis of infection, which has been generally done via methods for amplifying and detecting viral RNA molecules. Detection and quantitation of peptides using targeted mass spectrometry-based strategies has been proposed as an alternative diagnostic tool due to direct detection of molecular indicators from non-invasively collected samples as well as the potential for high-throughput analysis in a clinical setting; many studies have revealed the presence of viral peptides within easily accessed patient samples. However, evidence suggests that some viral peptides could serve as better indicators of COVID-19 infection status than others, due to potential misidentification of peptides derived from human host proteins, poor spectral quality, high limits of detection etc. METHODS: In this study we have compiled a list of 636 peptides identified from Sudden Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) samples, including from in vitro and clinical sources. These datasets were rigorously analyzed using automated, Galaxy-based workflows containing tools such as PepQuery, BLAST-P, and the Multi-omic Visualization Platform as well as the open-source tools MetaTryp and Proteomics Data Viewer (PDV). RESULTS Using PepQuery for confirming peptide spectrum matches, we were able to narrow down the 639-peptide possibilities to 87 peptides that were most robustly detected and specific to the SARS-CoV-2 virus. The specificity of these sequences to coronavirus taxa was confirmed using Unipept and BLAST-P. Through stringent p-value cutoff combined with manual verification of peptide spectrum match quality, 4 peptides derived from the nucleocapsid phosphoprotein and membrane protein were found to be most robustly detected across all cell culture and clinical samples, including those collected non-invasively. CONCLUSION We propose that these peptides would be of the most value for clinical proteomics applications seeking to detect COVID-19 from patient samples. We also contend that samples harvested from the upper respiratory tract and oral cavity have the highest potential for diagnosis of SARS-CoV-2 infection from easily collected patient samples using mass spectrometry-based proteomics assays.
Collapse
Affiliation(s)
- Andrew T Rajczewski
- Department of Biochemistry, Molecular and Cell Biology Building, University of Minnesota, 420 Washington Ave SE 7-129, Minneapolis, MN, 55455, USA
| | - Subina Mehta
- Department of Biochemistry, Molecular and Cell Biology Building, University of Minnesota, 420 Washington Ave SE 7-129, Minneapolis, MN, 55455, USA
| | - Dinh Duy An Nguyen
- Department of Biochemistry, Molecular and Cell Biology Building, University of Minnesota, 420 Washington Ave SE 7-129, Minneapolis, MN, 55455, USA
| | - Björn Grüning
- Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Thomas McGowan
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Timothy J Griffin
- Department of Biochemistry, Molecular and Cell Biology Building, University of Minnesota, 420 Washington Ave SE 7-129, Minneapolis, MN, 55455, USA
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular and Cell Biology Building, University of Minnesota, 420 Washington Ave SE 7-129, Minneapolis, MN, 55455, USA.
| |
Collapse
|
40
|
A tri-functional amino acid enables mapping of binding sites for posttranslational-modification-mediated protein-protein interactions. Mol Cell 2021; 81:2669-2681.e9. [PMID: 33894155 DOI: 10.1016/j.molcel.2021.04.001] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 03/01/2021] [Accepted: 03/31/2021] [Indexed: 12/16/2022]
Abstract
Posttranslational modification (PTM), through the recruitment of effector proteins (i.e., "readers") that signal downstream events, plays key roles in regulating a variety of cellular processes. To understand how a PTM is recognized, it is necessary to find its readers and, importantly, the location of the binding pockets responsible for PTM recognition. Although various methods have been developed to identify PTM readers, it remains a challenge to directly map the PTM-binding regions, especially for intrinsically disordered domains. Here, we demonstrate a photo-crosslinkable, clickable, and cleavable tri-functional amino acid, ADdis-Cys, that when coupled with mass spectrometry (ADdis-Cys-MS) can not only identify PTM readers from complex proteomes but also simultaneously map their PTM-recognition modules. Using ADdis-Cys-MS, we successfully identify the binding sites of several reader-PTM interactions, among which we discover human C1QBP as a histone chaperone. This robust method should find wide applications in examining other histone or non-histone PTM-mediated protein-protein interactions.
Collapse
|
41
|
Proteome Discoverer-A Community Enhanced Data Processing Suite for Protein Informatics. Proteomes 2021; 9:proteomes9010015. [PMID: 33806881 PMCID: PMC8006021 DOI: 10.3390/proteomes9010015] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 03/18/2021] [Accepted: 03/20/2021] [Indexed: 01/01/2023] Open
Abstract
Proteomics researchers today face an interesting challenge: how to choose among the dozens of data processing and analysis pipelines available for converting tandem mass spectrometry files to protein identifications. Due to the dominance of Orbitrap technology in proteomics in recent history, many researchers have defaulted to the vendor software Proteome Discoverer. Over the fourteen years since the initial release of the software, it has evolved in parallel with the increasingly complex demands faced by proteomics researchers. Today, Proteome Discoverer exists in two distinct forms with both powerful commercial versions and fully functional free versions in use in many labs today. Throughout the 11 main versions released to date, a central theme of the software has always been the ability to easily view and verify the spectra from which identifications are made. This ability is, even today, a key differentiator from other data analysis solutions. In this review I will attempt to summarize the history and evolution of Proteome Discoverer from its first launch to the versions in use today.
Collapse
|
42
|
Huang C, Chen L, Savage SR, Eguez RV, Dou Y, Li Y, da Veiga Leprevost F, Jaehnig EJ, Lei JT, Wen B, Schnaubelt M, Krug K, Song X, Cieślik M, Chang HY, Wyczalkowski MA, Li K, Colaprico A, Li QK, Clark DJ, Hu Y, Cao L, Pan J, Wang Y, Cho KC, Shi Z, Liao Y, Jiang W, Anurag M, Ji J, Yoo S, Zhou DC, Liang WW, Wendl M, Vats P, Carr SA, Mani DR, Zhang Z, Qian J, Chen XS, Pico AR, Wang P, Chinnaiyan AM, Ketchum KA, Kinsinger CR, Robles AI, An E, Hiltke T, Mesri M, Thiagarajan M, Weaver AM, Sikora AG, Lubiński J, Wierzbicka M, Wiznerowicz M, Satpathy S, Gillette MA, Miles G, Ellis MJ, Omenn GS, Rodriguez H, Boja ES, Dhanasekaran SM, Ding L, Nesvizhskii AI, El-Naggar AK, Chan DW, Zhang H, Zhang B. Proteogenomic insights into the biology and treatment of HPV-negative head and neck squamous cell carcinoma. Cancer Cell 2021; 39:361-379.e16. [PMID: 33417831 PMCID: PMC7946781 DOI: 10.1016/j.ccell.2020.12.007] [Citation(s) in RCA: 162] [Impact Index Per Article: 54.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 09/13/2020] [Accepted: 12/07/2020] [Indexed: 02/08/2023]
Abstract
We present a proteogenomic study of 108 human papilloma virus (HPV)-negative head and neck squamous cell carcinomas (HNSCCs). Proteomic analysis systematically catalogs HNSCC-associated proteins and phosphosites, prioritizes copy number drivers, and highlights an oncogenic role for RNA processing genes. Proteomic investigation of mutual exclusivity between FAT1 truncating mutations and 11q13.3 amplifications reveals dysregulated actin dynamics as a common functional consequence. Phosphoproteomics characterizes two modes of EGFR activation, suggesting a new strategy to stratify HNSCCs based on EGFR ligand abundance for effective treatment with inhibitory EGFR monoclonal antibodies. Widespread deletion of immune modulatory genes accounts for low immune infiltration in immune-cold tumors, whereas concordant upregulation of multiple immune checkpoint proteins may underlie resistance to anti-programmed cell death protein 1 monotherapy in immune-hot tumors. Multi-omic analysis identifies three molecular subtypes with high potential for treatment with CDK inhibitors, anti-EGFR antibody therapy, and immunotherapy, respectively. Altogether, proteogenomics provides a systematic framework to inform HNSCC biology and treatment.
Collapse
Affiliation(s)
- Chen Huang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Lijun Chen
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Rodrigo Vargas Eguez
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yize Li
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | | | - Eric J Jaehnig
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jonathan T Lei
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Michael Schnaubelt
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Karsten Krug
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Xiaoyu Song
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Marcin Cieślik
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hui-Yin Chang
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Matthew A Wyczalkowski
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Antonio Colaprico
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Division of Biostatistics, Department of Public Health Science, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Qing Kay Li
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - David J Clark
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Yingwei Hu
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Liwei Cao
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Jianbo Pan
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA; Department of Ophthalmology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Yuefan Wang
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Kyung-Cho Cho
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Zhiao Shi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yuxing Liao
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Wen Jiang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Meenakshi Anurag
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jiayi Ji
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Seungyeul Yoo
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Daniel Cui Zhou
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Wen-Wei Liang
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Michael Wendl
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Pankaj Vats
- Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Steven A Carr
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - D R Mani
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Zhen Zhang
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Jiang Qian
- Department of Ophthalmology, Johns Hopkins University, Baltimore, MD 21231, USA
| | - Xi S Chen
- Sylvester Comprehensive Cancer Center, University of Miami Miller School of Medicine, Miami, FL 33136, USA; Division of Biostatistics, Department of Public Health Science, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Alexander R Pico
- Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, USA
| | - Pei Wang
- Department of Genetics and Genomic Sciences and Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Arul M Chinnaiyan
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Christopher R Kinsinger
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Ana I Robles
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Eunkyung An
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Tara Hiltke
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Mehdi Mesri
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Mathangi Thiagarajan
- Leidos Biomedical Research Inc., Frederick NaVonal Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Alissa M Weaver
- Department of Cell and Developmental Biology, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Andrew G Sikora
- Department of Head and Neck Surgery, University of Texas M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Jan Lubiński
- Department of Genetics and Pathology, International Hereditary Cancer Center, Pomeranian Medical University, 71-252 Szczecin, Poland; International Institute for Molecular Oncology, 60-203 Poznań, Poland
| | - Małgorzata Wierzbicka
- Poznań University of Medical Sciences, 61-701 Poznań, Poland; Institute of Human Genetics Polish Academy of Sciences, 60-479 Poznań, Poland
| | - Maciej Wiznerowicz
- International Institute for Molecular Oncology, 60-203 Poznań, Poland; Poznań University of Medical Sciences, 61-701 Poznań, Poland
| | - Shankha Satpathy
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA
| | - Michael A Gillette
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142, USA; Division of Pulmonary and Critical Care Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - George Miles
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Matthew J Ellis
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Emily S Boja
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Saravana M Dhanasekaran
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA; Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Li Ding
- Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA; McDonnell Genome Institute, Washington University in St. Louis, St. Louis, MO 63108, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Adel K El-Naggar
- Department of Pathology, Division of Pathology and Laboratory Medicine, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Daniel W Chan
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA.
| | - Hui Zhang
- Department of Pathology and Oncology, Johns Hopkins University, Baltimore, MD 21231, USA.
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| | | |
Collapse
|
43
|
Rajczewski AT, Mehta S, Nguyen DDA, Grüning BA, Johnson JE, McGowan T, Griffin TJ, Jagtap PD. A rigorous evaluation of optimal peptide targets for MS-based clinical diagnostics of Coronavirus Disease 2019 (COVID-19). MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021:2021.02.09.21251427. [PMID: 33688669 PMCID: PMC7941646 DOI: 10.1101/2021.02.09.21251427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The Coronavirus Disease 2019 (COVID-19) global pandemic has had a profound, lasting impact on the world's population. A key aspect to providing care for those with COVID-19 and checking its further spread is early and accurate diagnosis of infection, which has been generally done via methods for amplifying and detecting viral RNA molecules. Detection and quantitation of peptides using targeted mass spectrometry-based strategies has been proposed as an alternative diagnostic tool due to direct detection of molecular indicators from non-invasively collected samples as well as the potential for high-throughput analysis in a clinical setting; many studies have revealed the presence of viral peptides within easily accessed patient samples. However, evidence suggests that some viral peptides could serve as better indicators of COVID-19 infection status than others, due to potential misidentification of peptides derived from human host proteins, poor spectral quality, high limits of detection etc. In this study we have compiled a list of 639 peptides identified from Sudden Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) samples, including from in vitro and clinical sources. These datasets were rigorously analyzed using automated, Galaxy-based workflows containing tools such as PepQuery, BLAST-P, and the Multi-omic Visualization Platform as well as the open-source tools MetaTryp and Proteomics Data Viewer (PDV). Using PepQuery for confirming peptide spectrum matches, we were able to narrow down the 639 peptide possibilities to 87 peptides which were most robustly detected and specific to the SARS-CoV-2 virus. The specificity of these sequences to coronavirus taxa was confirmed using Unipept and BLAST-P. Applying stringent statistical scoring thresholds, combined with manual verification of peptide spectrum match quality, 4 peptides derived from the nucleocapsid phosphoprotein and membrane protein were found to be most robustly detected across all cell culture and clinical samples, including those collected non-invasively. We propose that these peptides would be of the most value for clinical proteomics applications seeking to detect COVID-19 from a variety of sample types. We also contend that samples taken from the upper respiratory tract and oral cavity have the highest potential for diagnosis of SARS-CoV-2 infection from easily collected patient samples using mass spectrometry-based proteomics assays.
Collapse
Affiliation(s)
- Andrew T. Rajczewski
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Dinh Duy An Nguyen
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Björn A. Grüning
- Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - James E. Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN 55455, USA
| | - Thomas McGowan
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN 55455, USA
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
44
|
The first Conus genome assembly reveals a primary genetic central dogma of conopeptides in C. betulinus. Cell Discov 2021; 7:11. [PMID: 33619264 PMCID: PMC7900195 DOI: 10.1038/s41421-021-00244-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 12/29/2020] [Indexed: 01/28/2023] Open
Abstract
Although there are various Conus species with publicly available transcriptome and proteome data, no genome assembly has been reported yet. Here, using Chinese tubular cone snail (C. betulinus) as a representative, we sequenced and assembled the first Conus genome with original identification of 133 genome-widely distributed conopeptide genes. After integration of our genomics, transcriptomics, and peptidomics data in the same species, we established a primary genetic central dogma of diverse conopeptides, assuming a rough number ratio of ~1:1:1:10s for the total genes: transcripts: proteins: post-translationally modified peptides. This ratio may be special for this worm-hunting Conus species, due to the high diversity of various Conus genomes and the big number ranges of conopeptide genes, transcripts, and peptides in previous reports of diverse Conus species. Only a fraction (45.9%) of the identified conotopeptide genes from our achieved genome assembly are transcribed with transcriptomic evidence, and few genes individually correspond to multiple transcripts possibly due to intraspecies or mutation-based variances. Variable peptide processing at the proteomic level, generating a big diversity of venom conopeptides with alternative cleavage sites, post-translational modifications, and N-/C-terminal truncations, may explain how the 133 genes and ~123 transcripts can generate thousands of conopeptides in the venom of individual C. betulinus. We also predicted many conopeptides with high stereostructural similarities to the putative analgesic ω-MVIIA, addiction therapy AuIB and insecticide ImI, suggesting that our current genome assembly for C. betulinus is a valuable genetic resource for high-throughput prediction and development of potential pharmaceuticals.
Collapse
|
45
|
Impact of DJ-1 and Helix 8 on the Proteome and Degradome of Neuron-Like Cells. Cells 2021; 10:cells10020404. [PMID: 33669258 PMCID: PMC7920061 DOI: 10.3390/cells10020404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 02/08/2021] [Accepted: 02/10/2021] [Indexed: 12/04/2022] Open
Abstract
DJ-1 is an abundant and ubiquitous component of cellular proteomes. DJ-1 supposedly exerts a wide variety of molecular functions, ranging from enzymatic activities as a deglycase, protease, and esterase to chaperone functions. However, a consensus perspective on its molecular function in the cellular context has not yet been reached. Structurally, the C-terminal helix 8 of DJ-1 has been proposed to constitute a propeptide whose proteolytic removal transforms a DJ-1 zymogen to an active hydrolase with potential proteolytic activity. To better understand the cell-contextual functionality of DJ-1 and the role of helix 8, we employed post-mitotically differentiated, neuron-like SH-SY5Y neuroblastoma cells with stable over-expression of full length DJ-1 or DJ-1 lacking helix 8 (ΔH8), either with a native catalytically active site (C106) or an inactive site (C106A active site mutation). Global proteome comparison of cells over-expressing DJ-1 ΔH8 with native or mutated active site cysteine indicated a strong impact on mitochondrial biology. N-terminomic profiling however did not highlight direct protease substrate candidates for DJ-1 ΔH8, but linked DJ-1 to elevated levels of activated lysosomal proteases, albeit presumably in an indirect manner. Finally, we show that DJ-1 ΔH8 loses the deglycation activity of full length DJ-1. Our study further establishes DJ-1 as deglycation enzyme. Helix 8 is essential for the deglycation activity but dispensable for the impact on lysosomal and mitochondrial biology; further illustrating the pleiotropic nature of DJ-1.
Collapse
|
46
|
Hotta T, Haynes SE, Blasius TL, Gebbie M, Eberhardt EL, Sept D, Cianfrocco M, Verhey KJ, Nesvizhskii AI, Ohi R. Parthenolide Destabilizes Microtubules by Covalently Modifying Tubulin. Curr Biol 2021; 31:900-907.e6. [PMID: 33482110 DOI: 10.1016/j.cub.2020.11.055] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 11/02/2020] [Accepted: 11/19/2020] [Indexed: 12/28/2022]
Abstract
Detyrosination of the α-tubulin C-terminal tail is a post-translational modification (PTM) of microtubules that is key for many biological processes.1 Although detyrosination is the oldest known microtubule PTM,2-7 the carboxypeptidase responsible for this modification, VASH1/2-SVBP, was identified only 3 years ago,8,9 precluding genetic approaches to prevent detyrosination. Studies examining the cellular functions of detyrosination have therefore relied on a natural product, parthenolide, which is widely believed to block detyrosination of α-tubulin in cells, presumably by inhibiting the activity of the relevant carboxypeptidase(s).10 Parthenolide is a sesquiterpene lactone that forms covalent linkages predominantly with exposed thiol groups; e.g., on cysteine residues.11-13 Using mass spectrometry, we show that parthenolide forms adducts on both cysteine and histidine residues on tubulin itself, in vitro and in cells. Parthenolide causes tubulin protein aggregation and prevents the formation of microtubules. In contrast to epoY, an epoxide inhibitor of VASH1/2-SVBP,9 parthenolide does not block VASH1-SVBP activity in vitro. Lastly, we show that epoY is an efficacious inhibitor of microtubule detyrosination in cells, providing an alternative chemical means to block detyrosination. Collectively, our work supports the notion that parthenolide is a promiscuous inhibitor of many cellular processes and suggests that its ability to block detyrosination may be an indirect consequence of reducing the polymerization-competent pool of tubulin in cells.
Collapse
Affiliation(s)
- Takashi Hotta
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Sarah E Haynes
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Teresa L Blasius
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Margo Gebbie
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Emily L Eberhardt
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - David Sept
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, USA
| | - Michael Cianfrocco
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Kristen J Verhey
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Ryoma Ohi
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
47
|
|
48
|
Polasky DA, Yu F, Teo GC, Nesvizhskii AI. Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat Methods 2020. [PMID: 33020657 DOI: 10.1101/2020.05.18.102665] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
Recent advances in methods for enrichment and mass spectrometric analysis of intact glycopeptides have produced large-scale glycoproteomics datasets, but interpreting these data remains challenging. We present MSFragger-Glyco, a glycoproteomics mode of the MSFragger search engine, for fast and sensitive identification of N- and O-linked glycopeptides and open glycan searches. Reanalysis of recent N-glycoproteomics data resulted in annotation of 80% more glycopeptide spectrum matches (glycoPSMs) than previously reported. In published O-glycoproteomics data, our method more than doubled the number of glycoPSMs annotated when searching the same glycans as the original search, and yielded 4- to 6-fold increases when expanding searches to include additional glycan compositions and other modifications. Expanded searches also revealed many sulfated and complex glycans that remained hidden to the original search. With greatly improved spectral annotation, coupled with the speed of index-based scoring, MSFragger-Glyco makes it possible to comprehensively interrogate glycoproteomics data and illuminate the many roles of glycosylation.
Collapse
Affiliation(s)
- Daniel A Polasky
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Guo Ci Teo
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA.
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
49
|
Li K, Jain A, Malovannaya A, Wen B, Zhang B. DeepRescore: Leveraging Deep Learning to Improve Peptide Identification in Immunopeptidomics. Proteomics 2020; 20:e1900334. [PMID: 32864883 PMCID: PMC7718998 DOI: 10.1002/pmic.201900334] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 08/27/2020] [Indexed: 12/23/2022]
Abstract
The identification of major histocompatibility complex (MHC)-binding peptides in mass spectrometry (MS)-based immunopeptideomics relies largely on database search engines developed for proteomics data analysis. However, because immunopeptidomics experiments do not involve enzymatic digestion at specific residues, an inflated search space leads to a high false positive rate and low sensitivity in peptide identification. In order to improve the sensitivity and reliability of peptide identification, a post-processing tool named DeepRescore is developed. DeepRescore combines peptide features derived from deep learning predictions, namely accurate retention timeand MS/MS spectra predictions, with previously used features to rescore peptide-spectrum matches. Using two public immunopeptidomics datasets, it is shown that rescoring by DeepRescore increases both the sensitivity and reliability of MHC-binding peptide and neoantigen identifications compared to existing methods. It is also shown that the performance improvement is, to a large extent, driven by the deep learning-derived features. DeepRescore is developed using NextFlow and Docker and is available at https://github.com/bzhanglab/DeepRescore.
Collapse
Affiliation(s)
- Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Antrix Jain
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX 77030, USA
| | - Anna Malovannaya
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX 77030, USA
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
50
|
Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat Methods 2020; 17:1125-1132. [PMID: 33020657 PMCID: PMC7606558 DOI: 10.1038/s41592-020-0967-9] [Citation(s) in RCA: 121] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 08/31/2020] [Indexed: 12/15/2022]
Abstract
Recent advances in methods for enrichment and mass spectrometric analysis of intact glycopeptides have produced large-scale glycoproteomics datasets, but interpreting this data remains challenging. We present MSFragger-Glyco, a glycoproteomics mode of the MSFragger search engine, for fast and sensitive identification of N- and O-linked glycopeptides and open glycan searches. Reanalysis of recent N-glycoproteomics data resulted in annotation of 80% more glycopeptide-spectrum matches (glycoPSMs) than previously reported. In published O-glycoproteomics data, our method more than doubled the number of glycoPSMs annotated when searching the same glycans as the original search and yielded 4–6-fold increases when expanding searches to include additional glycan compositions and other modifications. Expanded searches also revealed many sulfated and complex glycans that remained hidden to the original search. With greatly improved spectral annotation, coupled with the speed of index-based scoring, MSFragger-Glyco makes it possible to comprehensively interrogate glycoproteomics data and illuminate the many roles of glycosylation.
Collapse
|