1
|
Frejno M, Berger MT, Tüshaus J, Hogrebe A, Seefried F, Graber M, Samaras P, Ben Fredj S, Sukumar V, Eljagh L, Bronshtein I, Mamisashvili L, Schneider M, Gessulat S, Schmidt T, Kuster B, Zolg DP, Wilhelm M. Unifying the analysis of bottom-up proteomics data with CHIMERYS. Nat Methods 2025; 22:1017-1027. [PMID: 40263583 PMCID: PMC12074992 DOI: 10.1038/s41592-025-02663-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 03/06/2025] [Indexed: 04/24/2025]
Abstract
Proteomic workflows generate vastly complex peptide mixtures that are analyzed by liquid chromatography-tandem mass spectrometry, creating thousands of spectra, most of which are chimeric and contain fragment ions from more than one peptide. Because of differences in data acquisition strategies such as data-dependent, data-independent or parallel reaction monitoring, separate software packages employing different analysis concepts are used for peptide identification and quantification, even though the underlying information is principally the same. Here, we introduce CHIMERYS, a spectrum-centric search algorithm designed for the deconvolution of chimeric spectra that unifies proteomic data analysis. Using accurate predictions of peptide retention time, fragment ion intensities and applying regularized linear regression, it explains as much fragment ion intensity as possible with as few peptides as possible. Together with rigorous false discovery rate control, CHIMERYS accurately identifies and quantifies multiple peptides per tandem mass spectrum in data-dependent, data-independent or parallel reaction monitoring experiments.
Collapse
Affiliation(s)
| | | | - Johanna Tüshaus
- School of Life Sciences, Technical University of Munich, Freising, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | - Bernhard Kuster
- School of Life Sciences, Technical University of Munich, Freising, Germany
- Munich Data Science Institute (MDSI), Technical University of Munich, Garching b. München, Germany
| | | | - Mathias Wilhelm
- School of Life Sciences, Technical University of Munich, Freising, Germany.
- Munich Data Science Institute (MDSI), Technical University of Munich, Garching b. München, Germany.
| |
Collapse
|
2
|
Hamood F, Gabriel W, Pfeiffer P, Kuster B, Wilhelm M, The M. ProSIMSIt: The Best of Both Worlds in Data-Driven Rescoring and Identification Transfer. J Proteome Res 2025; 24:2173-2180. [PMID: 40119808 PMCID: PMC11976853 DOI: 10.1021/acs.jproteome.4c00967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Revised: 02/19/2025] [Accepted: 03/10/2025] [Indexed: 03/24/2025]
Abstract
Multibatch isobaric labeling experiments are frequently applied for clinical and pharmaceutical studies of large sample cohorts. To tackle the critical issue of missing values in such studies, we introduce the ProSIMSIt pipeline. It combines the advantages of tandem mass spectrum clustering via SIMSI-Transfer and data-driven rescoring via Prosit and Oktoberfest. We demonstrate that these two tools are complementary and mutually beneficial. On large-scale cancer cohort data, ProSIMSIt increased the number of peptide spectrum matches (PSMs) by 40% on both global and phosphoproteome data sets. Furthermore, on data from proteome-wide drug-response profiling of post-translational modifications (decryptM), our pipeline substantially increased drug-PTM relations and revealed previously unseen downstream effects of drug target inhibition. ProSIMSIt is available as an open-source Python package with a simple command line interface that allows easy application to MaxQuant result files.
Collapse
Affiliation(s)
- Firas Hamood
- Chair
of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Wassim Gabriel
- Assistant
Professorship of Computational Mass Spectrometry, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Pia Pfeiffer
- Assistant
Professorship of Computational Mass Spectrometry, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Bernhard Kuster
- Chair
of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
- Munich
Data Science Institute (MDSI), Technical
University of Munich, 85748 Garching, Germany
| | - Mathias Wilhelm
- Assistant
Professorship of Computational Mass Spectrometry, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
- Munich
Data Science Institute (MDSI), Technical
University of Munich, 85748 Garching, Germany
| | - Matthew The
- Chair
of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| |
Collapse
|
3
|
Castaño JD, Beaudry F. Comparative Analysis of Data-Driven Rescoring Platforms for Improved Peptide Identification in HeLa Digest Samples. Proteomics 2025; 25:e202400225. [PMID: 39895169 PMCID: PMC11962579 DOI: 10.1002/pmic.202400225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 09/16/2024] [Accepted: 01/21/2025] [Indexed: 02/04/2025]
Abstract
Mass spectrometry is a critical tool to understand complex changes in biological processes. Despite significant advances in search engine technology, many spectra remain unassigned. This research evaluates the performance of three rescoring platforms, Oktoberfest, MS2Rescore, and inSPIRE, using MaxQuant output. The results indicated a substantial increase in identifications at the peptide level (40%-53%) and PSM level (64%-67%). However, some peptides were lost due to limitations in processing posttranslational modifications (PTMs)-with up to 75% of lost peptides exhibiting PTMs. Each platform displayed distinct strengths and weaknesses. For instance, inSPIRE performed best in terms of peptide identifications and unique peptides, while MS2Rescore performed better for PSMs at higher FDR values. Differences in platform performance stemmed from different sources: original search engine feature selection, type of ion series predicted, retention time predictor, and PTMs compatibility. Overall, inSPIRE showed a superior ability to harness original search engine results. Taken all together, rescoring platforms clearly outperformed original search results; however, they demanded additional computation time (up to 77%) and manual adjustments. The findings here underline the necessity of integrating rescoring platforms into current proteomics pipelines but also address some challenges in their implementation and optimization. Future integrated platforms may help enhance adoption.
Collapse
Affiliation(s)
- Jesus D. Castaño
- Département de Biomédecine Vétérinaire, Faculté de Médecine VétérinaireUniversité de MontréalSaint‐HyacintheCanada
- Centre de recherche sur le cerveau et l'apprentissage (CIRCA)Université de MontréalSaint‐HyacintheCanada
| | - Francis Beaudry
- Département de Biomédecine Vétérinaire, Faculté de Médecine VétérinaireUniversité de MontréalSaint‐HyacintheCanada
- Centre de recherche sur le cerveau et l'apprentissage (CIRCA)Université de MontréalSaint‐HyacintheCanada
| |
Collapse
|
4
|
Schneider M, Zolg DP, Samaras P, Ben Fredj S, Bold D, Guevende A, Hogrebe A, Berger MT, Graber M, Sukumar V, Mamisashvili L, Bronsthein I, Eljagh L, Gessulat S, Seefried F, Schmidt T, Frejno M. A Scalable, Web-Based Platform for Proteomics Data Processing, Result Storage and Analysis. J Proteome Res 2025; 24:1241-1249. [PMID: 39982847 PMCID: PMC11894649 DOI: 10.1021/acs.jproteome.4c00871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2024] [Revised: 12/20/2024] [Accepted: 01/23/2025] [Indexed: 02/23/2025]
Abstract
The exponential increase in proteomics data presents critical challenges for conventional processing workflows. These pipelines often consist of fragmented software packages, glued together using complex in-house scripts or error-prone manual workflows running on local hardware, which are costly to maintain and scale. The MSAID Platform offers a fully automated, managed proteomics data pipeline, consolidating formerly disjointed functions into unified, API-driven services that cover the entire process from raw data to biological insights. Backed by the cloud-native search algorithm CHIMERYS, as well as scalable cloud compute instances and data lakes, the platform facilitates efficient processing of large data sets, automation of processing via the command line, systematic result storage, analysis, and visualization. The data lake supports elastically growing storage and unified query capabilities, facilitating large-scale analyses and efficient reuse of previously processed data, such as aggregating longitudinally acquired studies. Users interact with the platform via a web interface, CLI client, or API, providing flexible, automated access. Readily available tools for accessing result data include browser-based interrogation and one-click visualizations for statistical analysis. The platform streamlines research processes, making advanced and automated proteomic workflows accessible to a broader range of scientists. The MSAID Platform is globally available via https://platform.msaid.io.
Collapse
|
5
|
Eckert S, Berner N, Kramer K, Schneider A, Müller J, Lechner S, Brajkovic S, Sakhteman A, Graetz C, Fackler J, Dudek M, Pfaffl MW, Knolle P, Wilhelm S, Kuster B. Decrypting the molecular basis of cellular drug phenotypes by dose-resolved expression proteomics. Nat Biotechnol 2025; 43:406-415. [PMID: 38714896 PMCID: PMC11919725 DOI: 10.1038/s41587-024-02218-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 03/25/2024] [Indexed: 03/20/2025]
Abstract
Proteomics is making important contributions to drug discovery, from target deconvolution to mechanism of action (MoA) elucidation and the identification of biomarkers of drug response. Here we introduce decryptE, a proteome-wide approach that measures the full dose-response characteristics of drug-induced protein expression changes that informs cellular drug MoA. Assaying 144 clinical drugs and research compounds against 8,000 proteins resulted in more than 1 million dose-response curves that can be interactively explored online in ProteomicsDB and a custom-built Shiny App. Analysis of the collective data provided molecular explanations for known phenotypic drug effects and uncovered new aspects of the MoA of human medicines. We found that histone deacetylase inhibitors potently and strongly down-regulated the T cell receptor complex resulting in impaired human T cell activation in vitro and ex vivo. This offers a rational explanation for the efficacy of histone deacetylase inhibitors in certain lymphomas and autoimmune diseases and explains their poor performance in treating solid tumors.
Collapse
Affiliation(s)
- Stephan Eckert
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
- German Cancer Consortium (DKTK), partner site Munich, a partnership between DKFZ and University Center Technical University of Munich, Munich, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Nicola Berner
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
- German Cancer Consortium (DKTK), partner site Munich, a partnership between DKFZ and University Center Technical University of Munich, Munich, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Karl Kramer
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Annika Schneider
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Julian Müller
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Severin Lechner
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Sarah Brajkovic
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Amirhossein Sakhteman
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Christian Graetz
- Chair of Animal Physiology and Immunology, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Jonas Fackler
- Institute of Molecular Immunology and Experimental Oncology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Michael Dudek
- Institute of Molecular Immunology and Experimental Oncology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Michael W Pfaffl
- Chair of Animal Physiology and Immunology, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Percy Knolle
- Institute of Molecular Immunology and Experimental Oncology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Stephanie Wilhelm
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany.
- German Cancer Consortium (DKTK), partner site Munich, a partnership between DKFZ and University Center Technical University of Munich, Munich, Germany.
| |
Collapse
|
6
|
Abele M, Soleymaniniya A, Bayer FP, Lomp N, Doll E, Meng C, Neuhaus K, Scherer S, Wenning M, Wantia N, Kuster B, Wilhelm M, Ludwig C. Proteomic Diversity in Bacteria: Insights and Implications for Bacterial Identification. Mol Cell Proteomics 2025; 24:100917. [PMID: 39880082 PMCID: PMC11919601 DOI: 10.1016/j.mcpro.2025.100917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Revised: 12/20/2024] [Accepted: 01/23/2025] [Indexed: 01/31/2025] Open
Abstract
Mass spectrometry-based proteomics has revolutionized bacterial identification and elucidated many molecular mechanisms underlying bacterial growth, community formation, and drug resistance. However, most research has been focused on a few model bacteria, overlooking bacterial diversity. In this study, we present the most extensive bacterial proteomic resource to date, covering 303 species, 119 genera, and five phyla with over 636,000 unique expressed proteins, confirming the existence of over 38,700 hypothetical proteins. Accessible via the public resource ProteomicsDB, this dataset enables quantitative exploration of proteins within and across species. Additionally, we developed MS2Bac, a bacterial identification algorithm that queries NCBI's bacterial proteome space in two iterations. MS2Bac achieved over 99% species-level and 89% strain-level accuracy, surpassing methods like MALDI-TOF and FTIR, as demonstrated with food-derived bacterial isolates. MS2Bac also effectively identified bacteria in clinical samples, highlighting the potential of MS-based proteomics as a routine diagnostic tool.
Collapse
Affiliation(s)
- Miriam Abele
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Armin Soleymaniniya
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Florian P Bayer
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Nina Lomp
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Etienne Doll
- Research Department Molecular Life Sciences, TUM School of Life Sciences, Freising, Germany
| | - Chen Meng
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Klaus Neuhaus
- Core Facility Microbiome, ZIEL Institute for Food & Health, Technical University of Munich, Freising, Germany
| | - Siegfried Scherer
- Research Department Molecular Life Sciences, TUM School of Life Sciences, Freising, Germany
| | - Mareike Wenning
- Bavarian Health and Food Safety Authority, Unit for Food Microbiology and Hygiene, Oberschleißheim, Germany
| | - Nina Wantia
- Institut für Medizinische Mikrobiologie, Immunologie und Hygiene, TUM School of Medicine and Health Department Preclinical Medicine, Technical University of Munich, Munich, Germany
| | - Bernhard Kuster
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Munich Data Science Institute (MDSI), Technical University of Munich, Garching, Germany
| | - Mathias Wilhelm
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Munich Data Science Institute (MDSI), Technical University of Munich, Garching, Germany
| | - Christina Ludwig
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany.
| |
Collapse
|
7
|
Guo T, Steen JA, Mann M. Mass-spectrometry-based proteomics: from single cells to clinical applications. Nature 2025; 638:901-911. [PMID: 40011722 DOI: 10.1038/s41586-025-08584-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2024] [Accepted: 01/02/2025] [Indexed: 02/28/2025]
Abstract
Mass-spectrometry (MS)-based proteomics has evolved into a powerful tool for comprehensively analysing biological systems. Recent technological advances have markedly increased sensitivity, enabling single-cell proteomics and spatial profiling of tissues. Simultaneously, improvements in throughput and robustness are facilitating clinical applications. In this Review, we present the latest developments in proteomics technology, including novel sample-preparation methods, advanced instrumentation and innovative data-acquisition strategies. We explore how these advances drive progress in key areas such as protein-protein interactions, post-translational modifications and structural proteomics. Integrating artificial intelligence into the proteomics workflow accelerates data analysis and biological interpretation. We discuss the application of proteomics to single-cell analysis and spatial profiling, which can provide unprecedented insights into cellular heterogeneity and tissue architecture. Finally, we examine the transition of proteomics from basic research to clinical practice, including biomarker discovery in body fluids and the promise and challenges of implementing proteomics-based diagnostics. This Review provides a broad and high-level overview of the current state of proteomics and its potential to revolutionize our understanding of biology and transform medical practice.
Collapse
Affiliation(s)
- Tiannan Guo
- State Key Laboratory of Medical Proteomics, School of Medicine, Westlake University, Hangzhou, China.
- Westlake Center for Intelligent Proteomics, Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, China.
- Research Center for Industries of the Future, School of Life Sciences, Westlake University, Hangzhou, China.
| | - Judith A Steen
- Department of Neurology, Harvard Medical School, Boston, MA, USA.
- F.M. Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA.
| | - Matthias Mann
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
- NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
8
|
Wen B, Freestone J, Riffle M, MacCoss MJ, Noble WS, Keich U. Assessment of false discovery rate control in tandem mass spectrometry analysis using entrapment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.06.01.596967. [PMID: 38895431 PMCID: PMC11185562 DOI: 10.1101/2024.06.01.596967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
A pressing statistical challenge in the field of mass spectrometry proteomics is how to assess whether a given software tool provides accurate error control. Each software tool for searching such data uses its own internally implemented methodology for reporting and controlling the error. Many of these software tools are closed source, with incompletely documented methodology, and the strategies for validating the error are inconsistent across tools. In this work, we identify three different methods for validating false discovery rate (FDR) control in use in the field, one of which is invalid, one of which can only provide a lower bound rather than an upper bound, and one of which is valid but under-powered. The result is that the field has a very poor understanding of how well we are doing with respect to FDR control, particularly for the analysis of data-independent acquisition (DIA) data. We therefore propose a theoretical formulation of entrapment experiments that allows us to rigorously characterize the behavior of the various entrapment methods. We also propose a more powerful method for evaluating FDR control, and we employ that method, along with other existing techniques, to characterize a variety of popular search tools. We empirically validate our entrapment analysis in the fairly well-understood DDA setup before applying it in the DIA setup. We find that none of the DIA search tools consistently controls the FDR at the peptide level, and the tools struggle particularly with analysis of single cell datasets.
Collapse
Affiliation(s)
- Bo Wen
- Department of Genome Sciences, University of Washington
| | - Jack Freestone
- School of Mathematics and Statistics, University of Sydney
| | | | | | - William S. Noble
- Department of Genome Sciences, University of Washington
- Paul G. Allen School of Computer Science and Engineering, University of Washington
| | - Uri Keich
- School of Mathematics and Statistics, University of Sydney
| |
Collapse
|
9
|
Tüshaus J, Eckert S, Schliemann M, Zhou Y, Pfeiffer P, Halves C, Fusco F, Weigel J, Hönikl L, Butenschön V, Todorova R, Rauert-Wunderlich H, The M, Rosenwald A, Heinemann V, Holch J, Steiger K, Delbridge C, Meyer B, Weichert W, Mogler C, Kuhn PH, Kuster B. Towards routine proteome profiling of FFPE tissue: insights from a 1,220-case pan-cancer study. EMBO J 2025; 44:304-329. [PMID: 39558110 PMCID: PMC11697351 DOI: 10.1038/s44318-024-00289-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 10/08/2024] [Accepted: 10/14/2024] [Indexed: 11/20/2024] Open
Abstract
Proteome profiling of formalin-fixed paraffin-embedded (FFPE) specimens has gained traction for the analysis of cancer tissue for the discovery of molecular biomarkers. However, reports so far focused on single cancer entities, comprised relatively few cases and did not assess the long-term performance of experimental workflows. In this study, we analyze 1220 tumors from six cancer entities processed over the course of three years. Key findings include the need for a new normalization method ensuring equal and reproducible sample loading for LC-MS/MS analysis across cohorts, showing that tumors can, on average, be profiled to a depth of >4000 proteins and discovering that current software fails to process such large ion mobility-based online fractionated datasets. We report the first comprehensive pan-cancer proteome expression resource for FFPE material comprising 11,000 proteins which is of immediate utility to the scientific community, and can be explored via a web resource. It enables a range of analyses including quantitative comparisons of proteins between patients and cohorts, the discovery of protein fingerprints representing the tissue of origin or proteins enriched in certain cancer entities.
Collapse
Affiliation(s)
- Johanna Tüshaus
- Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Stephan Eckert
- Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
- German Cancer Consortium (DKTK), Partner Site Munich, a Partnership between DKFZ and University Center Technical University of Munich, Munich, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Marius Schliemann
- Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
- Bavarian Cancer Research Center (BZKF), Munich, Germany
| | - Yuxiang Zhou
- German Cancer Consortium (DKTK), Partner Site Munich, a Partnership between DKFZ and University Center Technical University of Munich, Munich, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Pauline Pfeiffer
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Christiane Halves
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Federico Fusco
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Johannes Weigel
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Lisa Hönikl
- Department of Neurosurgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Vicki Butenschön
- Department of Neurosurgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Rumyana Todorova
- Department of Medicine III and Comprehensive Cancer Center Munich, University Hospital, Ludwig-Maximilians University Munich, Munich, Germany
| | | | - Matthew The
- Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany
| | | | - Volker Heinemann
- Department of Medicine III and Comprehensive Cancer Center Munich, University Hospital, Ludwig-Maximilians University Munich, Munich, Germany
| | - Julian Holch
- Department of Medicine III and Comprehensive Cancer Center Munich, University Hospital, Ludwig-Maximilians University Munich, Munich, Germany
| | - Katja Steiger
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Claire Delbridge
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Bernhard Meyer
- Department of Neurosurgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Wilko Weichert
- German Cancer Consortium (DKTK), Partner Site Munich, a Partnership between DKFZ and University Center Technical University of Munich, Munich, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Carolin Mogler
- German Cancer Consortium (DKTK), Partner Site Munich, a Partnership between DKFZ and University Center Technical University of Munich, Munich, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Peer-Hendrik Kuhn
- German Cancer Consortium (DKTK), Partner Site Munich, a Partnership between DKFZ and University Center Technical University of Munich, Munich, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Bernhard Kuster
- Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising, Germany.
- German Cancer Consortium (DKTK), Partner Site Munich, a Partnership between DKFZ and University Center Technical University of Munich, Munich, Germany.
- German Cancer Research Center (DKFZ), Heidelberg, Germany.
- Bavarian Cancer Research Center (BZKF), Munich, Germany.
| |
Collapse
|
10
|
Fröhlich K, Fahrner M, Brombacher E, Seredynska A, Maldacker M, Kreutz C, Schmidt A, Schilling O. Data-Independent Acquisition: A Milestone and Prospect in Clinical Mass Spectrometry-Based Proteomics. Mol Cell Proteomics 2024; 23:100800. [PMID: 38880244 PMCID: PMC11380018 DOI: 10.1016/j.mcpro.2024.100800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 06/08/2024] [Accepted: 06/13/2024] [Indexed: 06/18/2024] Open
Abstract
Data-independent acquisition (DIA) has revolutionized the field of mass spectrometry (MS)-based proteomics over the past few years. DIA stands out for its ability to systematically sample all peptides in a given m/z range, allowing an unbiased acquisition of proteomics data. This greatly mitigates the issue of missing values and significantly enhances quantitative accuracy, precision, and reproducibility compared to many traditional methods. This review focuses on the critical role of DIA analysis software tools, primarily focusing on their capabilities and the challenges they address in proteomic research. Advances in MS technology, such as trapped ion mobility spectrometry, or high field asymmetric waveform ion mobility spectrometry require sophisticated analysis software capable of handling the increased data complexity and exploiting the full potential of DIA. We identify and critically evaluate leading software tools in the DIA landscape, discussing their unique features, and the reliability of their quantitative and qualitative outputs. We present the biological and clinical relevance of DIA-MS and discuss crucial publications that paved the way for in-depth proteomic characterization in patient-derived specimens. Furthermore, we provide a perspective on emerging trends in clinical applications and present upcoming challenges including standardization and certification of MS-based acquisition strategies in molecular diagnostics. While we emphasize the need for continuous development of software tools to keep pace with evolving technologies, we advise researchers against uncritically accepting the results from DIA software tools. Each tool may have its own biases, and some may not be as sensitive or reliable as others. Our overarching recommendation for both researchers and clinicians is to employ multiple DIA analysis tools, utilizing orthogonal analysis approaches to enhance the robustness and reliability of their findings.
Collapse
Affiliation(s)
- Klemens Fröhlich
- Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany
| | - Eva Brombacher
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg, Germany; Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
| | - Adrianna Seredynska
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
| | - Maximilian Maldacker
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
| | - Clemens Kreutz
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg, Germany
| | - Alexander Schmidt
- Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany.
| |
Collapse
|
11
|
Lin A, See D, Fondrie WE, Keich U, Noble WS. Target-decoy false discovery rate estimation using Crema. Proteomics 2024; 24:e2300084. [PMID: 38380501 DOI: 10.1002/pmic.202300084] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 01/06/2024] [Accepted: 01/16/2024] [Indexed: 02/22/2024]
Abstract
Assigning statistical confidence estimates to discoveries produced by a tandem mass spectrometry proteomics experiment is critical to enabling principled interpretation of the results and assessing the cost/benefit ratio of experimental follow-up. The most common technique for computing such estimates is to use target-decoy competition (TDC), in which observed spectra are searched against a database of real (target) peptides and a database of shuffled or reversed (decoy) peptides. TDC procedures for estimating the false discovery rate (FDR) at a given score threshold have been developed for application at the level of spectra, peptides, or proteins. Although these techniques are relatively straightforward to implement, it is common in the literature to skip over the implementation details or even to make mistakes in how the TDC procedures are applied in practice. Here we present Crema, an open-source Python tool that implements several TDC methods of spectrum-, peptide- and protein-level FDR estimation. Crema is compatible with a variety of existing database search tools and provides a straightforward way to obtain robust FDR estimates.
Collapse
Affiliation(s)
- Andy Lin
- Chemical and Biological Signatures, Pacific Northwest National Laboratory, Seattle, Washington, USA
| | - Donavan See
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
| | | | - Uri Keich
- School of Mathematics and Statistics, University of Sydney, Sydney, Australia
| | - William Stafford Noble
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| |
Collapse
|
12
|
Picciani M, Gabriel W, Giurcoiu VG, Shouman O, Hamood F, Lautenbacher L, Jensen CB, Müller J, Kalhor M, Soleymaniniya A, Kuster B, The M, Wilhelm M. Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit. Proteomics 2024; 24:e2300112. [PMID: 37672792 DOI: 10.1002/pmic.202300112] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/17/2023] [Accepted: 08/18/2023] [Indexed: 09/08/2023]
Abstract
Machine learning (ML) and deep learning (DL) models for peptide property prediction such as Prosit have enabled the creation of high quality in silico reference libraries. These libraries are used in various applications, ranging from data-independent acquisition (DIA) data analysis to data-driven rescoring of search engine results. Here, we present Oktoberfest, an open source Python package of our spectral library generation and rescoring pipeline originally only available online via ProteomicsDB. Oktoberfest is largely search engine agnostic and provides access to online peptide property predictions, promoting the adoption of state-of-the-art ML/DL models in proteomics analysis pipelines. We demonstrate its ability to reproduce and even improve our results from previously published rescoring analyses on two distinct use cases. Oktoberfest is freely available on GitHub (https://github.com/wilhelm-lab/oktoberfest) and can easily be installed locally through the cross-platform PyPI Python package.
Collapse
Affiliation(s)
- Mario Picciani
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Wassim Gabriel
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Victor-George Giurcoiu
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Omar Shouman
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Firas Hamood
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Ludwig Lautenbacher
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Cecilia Bang Jensen
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Julian Müller
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mostafa Kalhor
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Armin Soleymaniniya
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Matthew The
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mathias Wilhelm
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| |
Collapse
|
13
|
Kotimoole CN, Ramya VK, Kaur P, Reiling N, Shandil RK, Narayanan S, Flo TH, Prasad TSK. Discovery of Species-Specific Proteotypic Peptides To Establish a Spectral Library Platform for Identification of Nontuberculosis Mycobacteria from Mass Spectrometry-Based Proteomics. J Proteome Res 2024; 23:1102-1117. [PMID: 38358903 DOI: 10.1021/acs.jproteome.3c00850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2024]
Abstract
Nontuberculous mycobacteria are opportunistic bacteria pulmonary and extra-pulmonary infections in humans that closely resemble Mycobacterium tuberculosis. Although genome sequencing strategies helped determine NTMs, a common assay for the detection of coinfection by multiple NTMs with M. tuberculosis in the primary attempt of diagnosis is still elusive. Such a lack of efficiency leads to delayed therapy, an inappropriate choice of drugs, drug resistance, disease complications, morbidity, and mortality. Although a high-resolution LC-MS/MS-based multiprotein panel assay can be developed due to its specificity and sensitivity, it needs a library of species-specific peptides as a platform. Toward this, we performed an analysis of proteomes of 9 NTM species with more than 20 million peptide spectrum matches gathered from 26 proteome data sets. Our metaproteomic analyses determined 48,172 species-specific proteotypic peptides across 9 NTMs. Notably, M. smegmatis (26,008), M. abscessus (12,442), M. vaccae (6487), M. fortuitum (1623), M. avium subsp. paratuberculosis (844), M. avium subsp. hominissuis (580), and M. marinum (112) displayed >100 species-specific proteotypic peptides. Finally, these peptides and corresponding spectra have been compiled into a spectral library, FASTA, and JSON formats for future reference and validation in clinical cohorts by the biomedical community for further translation.
Collapse
Affiliation(s)
- Chinmaya Narayana Kotimoole
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Vadageri Krishnamurthy Ramya
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Parvinder Kaur
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Norbert Reiling
- Microbial Interface Biology, Research Center Borstel, Leibniz Lung Center, Parkallee 22, D-23845 Borstel, Germany
- German Center for Infection Research (DZIF), Site Hamburg-Lübeck-Borstel-Riems, 23845 Borstel, Germany
| | - Radha Krishan Shandil
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Shridhar Narayanan
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Trude Helen Flo
- Centre of Molecular Inflammation Research, Department of Clinical and Molecular Medicine Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, Kunnskapssenteret, Øya 424.04.035, Norway
| | | |
Collapse
|
14
|
The M, Picciani M, Jensen C, Gabriel W, Kuster B, Wilhelm M. AI-Assisted Processing Pipeline to Boost Protein Isoform Detection. Methods Mol Biol 2024; 2836:157-181. [PMID: 38995541 DOI: 10.1007/978-1-0716-4007-4_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
Proteomics, the study of proteins within biological systems, has seen remarkable advancements in recent years, with protein isoform detection emerging as one of the next major frontiers. One of the primary challenges is achieving the necessary peptide and protein coverage to confidently differentiate isoforms as a result of the protein inference problem and protein false discovery rate estimation challenge in large data. In this chapter, we describe the application of artificial intelligence-assisted peptide property prediction for database search engine rescoring by Oktoberfest, an approach that has proven effective, particularly for complex samples and extensive search spaces, which can greatly increase peptide coverage. Further, it illustrates a method for increasing isoform coverage by the PickedGroupFDR approach that is designed to excel when applied on large data. Real-world examples are provided to illustrate the utility of the tools in the context of rescoring, protein grouping, and false discovery rate estimation. By implementing these cutting-edge techniques, researchers can achieve a substantial increase in both peptide and isoform coverage, thus unlocking the potential of protein isoform detection in their studies and shedding light on their roles and functions in biological processes.
Collapse
Affiliation(s)
- Matthew The
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mario Picciani
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Cecilia Jensen
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Wassim Gabriel
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mathias Wilhelm
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany.
| |
Collapse
|
15
|
Liu L, Trendel J, Jiang G, Liu Y, Bruckmann A, Küster B, Sprunck S, Dresselhaus T, Bleckmann A. RBPome identification in egg-cell like callus of Arabidopsis. Biol Chem 2023; 404:1137-1149. [PMID: 37768858 DOI: 10.1515/hsz-2023-0195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 09/11/2023] [Indexed: 09/30/2023]
Abstract
RNA binding proteins (RBPs) have multiple and essential roles in transcriptional and posttranscriptional regulation of gene expression in all living organisms. Their biochemical identification in the proteome of a given cell or tissue requires significant protein amounts, which limits studies in rare and highly specialized cells. As a consequence, we know almost nothing about the role(s) of RBPs in reproductive processes such as egg cell development, fertilization and early embryogenesis in flowering plants. To systematically identify the RBPome of egg cells in the model plant Arabidopsis, we performed RNA interactome capture (RIC) experiments using the egg cell-like RKD2-callus and were able to identify 728 proteins associated with poly(A+)-RNA. Transcripts for 97 % of identified proteins could be verified in the egg cell transcriptome. 46 % of identified proteins can be associated with the RNA life cycle. Proteins involved in mRNA binding, RNA processing and metabolism are highly enriched. Compared with the few available RBPome datasets of vegetative plant tissues, we identified 475 egg cell-enriched RBPs, which will now serve as a resource to study RBP function(s) during egg cell development, fertilization and early embryogenesis. First candidates were already identified showing an egg cell-specific expression pattern in ovules.
Collapse
Affiliation(s)
- Liping Liu
- Cell Biology and Plant Biochemistry, University of Regensburg, D-93053 Regensburg, Germany
| | - Jakob Trendel
- Chair of Proteomics and Bioanalytics, Technical University of Munich (TUM), D-85354 Freising, Germany
| | - Guojing Jiang
- Cell Biology and Plant Biochemistry, University of Regensburg, D-93053 Regensburg, Germany
| | - Yanhui Liu
- College of Life Science, Longyan University, Longyan 364012, China
| | - Astrid Bruckmann
- Biochemistry I, University of Regensburg, D-93053 Regensburg, Germany
| | - Bernhard Küster
- Chair of Proteomics and Bioanalytics, Technical University of Munich (TUM), D-85354 Freising, Germany
| | - Stefanie Sprunck
- Cell Biology and Plant Biochemistry, University of Regensburg, D-93053 Regensburg, Germany
| | - Thomas Dresselhaus
- Cell Biology and Plant Biochemistry, University of Regensburg, D-93053 Regensburg, Germany
| | - Andrea Bleckmann
- Cell Biology and Plant Biochemistry, University of Regensburg, D-93053 Regensburg, Germany
| |
Collapse
|
16
|
Abele M, Doll E, Bayer FP, Meng C, Lomp N, Neuhaus K, Scherer S, Kuster B, Ludwig C. Unified Workflow for the Rapid and In-Depth Characterization of Bacterial Proteomes. Mol Cell Proteomics 2023; 22:100612. [PMID: 37391045 PMCID: PMC10407251 DOI: 10.1016/j.mcpro.2023.100612] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 05/18/2023] [Accepted: 06/26/2023] [Indexed: 07/02/2023] Open
Abstract
Bacteria are the most abundant and diverse organisms among the kingdoms of life. Due to this excessive variance, finding a unified, comprehensive, and safe workflow for quantitative bacterial proteomics is challenging. In this study, we have systematically evaluated and optimized sample preparation, mass spectrometric data acquisition, and data analysis strategies in bacterial proteomics. We investigated workflow performances on six representative species with highly different physiologic properties to mimic bacterial diversity. The best sample preparation strategy was a cell lysis protocol in 100% trifluoroacetic acid followed by an in-solution digest. Peptides were separated on a 30-min linear microflow liquid chromatography gradient and analyzed in data-independent acquisition mode. Data analysis was performed with DIA-NN using a predicted spectral library. Performance was evaluated according to the number of identified proteins, quantitative precision, throughput, costs, and biological safety. With this rapid workflow, over 40% of all encoded genes were detected per bacterial species. We demonstrated the general applicability of our workflow on a set of 23 taxonomically and physiologically diverse bacterial species. We could confidently identify over 45,000 proteins in the combined dataset, of which 30,000 have not been experimentally validated before. Our work thereby provides a valuable resource for the microbial scientific community. Finally, we grew Escherichia coli and Bacillus cereus in replicates under 12 different cultivation conditions to demonstrate the high-throughput suitability of the workflow. The proteomic workflow we present in this manuscript does not require any specialized equipment or commercial software and can be easily applied by other laboratories to support and accelerate the proteomic exploration of the bacterial kingdom.
Collapse
Affiliation(s)
- Miriam Abele
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Division of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Etienne Doll
- Division of Microbial Ecology, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Florian P Bayer
- Division of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Chen Meng
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Nina Lomp
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Klaus Neuhaus
- Division of Microbial Ecology, TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Core Facility Microbiome, ZIEL - Institute for Food & Health, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Siegfried Scherer
- Division of Microbial Ecology, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Bernhard Kuster
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Division of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Christina Ludwig
- Bavarian Center for Biomolecular Mass Spectrometry (BayBioMS), TUM School of Life Sciences, Technical University of Munich, Freising, Germany.
| |
Collapse
|
17
|
Phlairaharn T, Ye Z, Krismer E, Pedersen AK, Pietzner M, Olsen JV, Schoof EM, Searle BC. Optimizing Linear Ion-Trap Data-Independent Acquisition toward Single-Cell Proteomics. Anal Chem 2023; 95:9881-9891. [PMID: 37338819 DOI: 10.1021/acs.analchem.3c00842] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2023]
Abstract
A linear ion trap (LIT) is an affordable, robust mass spectrometer that provides fast scanning speed and high sensitivity, where its primary disadvantage is inferior mass accuracy compared to more commonly used time-of-flight or orbitrap (OT) mass analyzers. Previous efforts to utilize the LIT for low-input proteomics analysis still rely on either built-in OTs for collecting precursor data or OT-based library generation. Here, we demonstrate the potential versatility of the LIT for low-input proteomics as a stand-alone mass analyzer for all mass spectrometry (MS) measurements, including library generation. To test this approach, we first optimized LIT data acquisition methods and performed library-free searches with and without entrapment peptides to evaluate both the detection and quantification accuracy. We then generated matrix-matched calibration curves to estimate the lower limit of quantification using only 10 ng of starting material. While LIT-MS1 measurements provided poor quantitative accuracy, LIT-MS2 measurements were quantitatively accurate down to 0.5 ng on the column. Finally, we optimized a suitable strategy for spectral library generation from low-input material, which we used to analyze single-cell samples by LIT-DIA using LIT-based libraries generated from as few as 40 cells.
Collapse
Affiliation(s)
- Teeradon Phlairaharn
- The Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, København 2200, Denmark
- Department of Bioscience, TUM School of Natural Sciences, Technical University of Munich, Garching (bei München) 85748, Germany
- Computational Medicine, Berlin Institute of Health at Charité─Universitätsmedizin Berlin, Berlin 10117, Germany
| | - Zilu Ye
- The Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, København 2200, Denmark
| | - Elena Krismer
- The Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, København 2200, Denmark
| | - Anna-Kathrine Pedersen
- The Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, København 2200, Denmark
| | - Maik Pietzner
- Computational Medicine, Berlin Institute of Health at Charité─Universitätsmedizin Berlin, Berlin 10117, Germany
| | - Jesper V Olsen
- The Novo Nordisk Foundation Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, København 2200, Denmark
| | - Erwin M Schoof
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby 2800, Denmark
| | - Brian C Searle
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio 43210, United States
- Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
18
|
Higgins L, Gerdes H, Cutillas PR. Principles of phosphoproteomics and applications in cancer research. Biochem J 2023; 480:403-420. [PMID: 36961757 PMCID: PMC10212522 DOI: 10.1042/bcj20220220] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 02/24/2023] [Accepted: 02/28/2023] [Indexed: 03/25/2023]
Abstract
Phosphorylation constitutes the most common and best-studied regulatory post-translational modification in biological systems and archetypal signalling pathways driven by protein and lipid kinases are disrupted in essentially all cancer types. Thus, the study of the phosphoproteome stands to provide unique biological information on signalling pathway activity and on kinase network circuitry that is not captured by genetic or transcriptomic technologies. Here, we discuss the methods and tools used in phosphoproteomics and highlight how this technique has been used, and can be used in the future, for cancer research. Challenges still exist in mass spectrometry phosphoproteomics and in the software required to provide biological information from these datasets. Nevertheless, improvements in mass spectrometers with enhanced scan rates, separation capabilities and sensitivity, in biochemical methods for sample preparation and in computational pipelines are enabling an increasingly deep analysis of the phosphoproteome, where previous bottlenecks in data acquisition, processing and interpretation are being relieved. These powerful hardware and algorithmic innovations are not only providing exciting new mechanistic insights into tumour biology, from where new drug targets may be derived, but are also leading to the discovery of phosphoproteins as mediators of drug sensitivity and resistance and as classifiers of disease subtypes. These studies are, therefore, uncovering phosphoproteins as a new generation of disruptive biomarkers to improve personalised anti-cancer therapies.
Collapse
Affiliation(s)
- Luke Higgins
- Cell Signaling and Proteomics Group, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, U.K
| | - Henry Gerdes
- Cell Signaling and Proteomics Group, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, U.K
| | - Pedro R. Cutillas
- Cell Signaling and Proteomics Group, Centre for Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, U.K
- Alan Turing Institute, The British Library, London, U.K
- Digital Environment Research Institute, Queen Mary University of London, London, U.K
| |
Collapse
|
19
|
Prakash A, García-Seisdedos D, Wang S, Kundu DJ, Collins A, George N, Moreno P, Papatheodorou I, Jones AR, Vizcaíno JA. Integrated View of Baseline Protein Expression in Human Tissues. J Proteome Res 2023; 22:729-742. [PMID: 36577097 PMCID: PMC9990129 DOI: 10.1021/acs.jproteome.2c00406] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The availability of proteomics datasets in the public domain, and in the PRIDE database, in particular, has increased dramatically in recent years. This unprecedented large-scale availability of data provides an opportunity for combined analyses of datasets to get organism-wide protein abundance data in a consistent manner. We have reanalyzed 24 public proteomics datasets from healthy human individuals to assess baseline protein abundance in 31 organs. We defined tissue as a distinct functional or structural region within an organ. Overall, the aggregated dataset contains 67 healthy tissues, corresponding to 3,119 mass spectrometry runs covering 498 samples from 489 individuals. We compared protein abundances between different organs and studied the distribution of proteins across these organs. We also compared the results with data generated in analogous studies. Additionally, we performed gene ontology and pathway-enrichment analyses to identify organ-specific enriched biological processes and pathways. As a key point, we have integrated the protein abundance results into the resource Expression Atlas, where they can be accessed and visualized either individually or together with gene expression data coming from transcriptomics datasets. We believe this is a good mechanism to make proteomics data more accessible for life scientists.
Collapse
Affiliation(s)
- Ananth Prakash
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom.,Open Targets, Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| | - David García-Seisdedos
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| | - Shengbo Wang
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| | - Deepti Jaiswal Kundu
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| | - Andrew Collins
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, LiverpoolL69 7ZB, United Kingdom
| | - Nancy George
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| | - Pablo Moreno
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| | - Irene Papatheodorou
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom.,Open Targets, Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| | - Andrew R Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, LiverpoolL69 7ZB, United Kingdom
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom.,Open Targets, Wellcome Genome Campus, Hinxton, CambridgeCB10 1SD, United Kingdom
| |
Collapse
|
20
|
Phlairaharn T, Ye Z, Krismer E, Pedersen AK, Pietzner M, Olsen JV, Schoof EM, Searle BC. Optimizing linear ion trap data independent acquisition towards single cell proteomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.21.529444. [PMID: 36865114 PMCID: PMC9980145 DOI: 10.1101/2023.02.21.529444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]
Abstract
A linear ion trap (LIT) is an affordable, robust mass spectrometer that proves fast scanning speed and high sensitivity, where its primary disadvantage is inferior mass accuracy compared to more commonly used time-of-flight (TOF) or orbitrap (OT) mass analyzers. Previous efforts to utilize the LIT for low-input proteomics analysis still rely on either built-in OTs for collecting precursor data or OT-based library generation. Here, we demonstrate the potential versatility of the LIT for low-input proteomics as a stand-alone mass analyzer for all mass spectrometry measurements, including library generation. To test this approach, we first optimized LIT data acquisition methods and performed library-free searches with and without entrapment peptides to evaluate both the detection and quantification accuracy. We then generated matrix-matched calibration curves to estimate the lower limit of quantification using only 10 ng of starting material. While LIT-MS1 measurements provided poor quantitative accuracy, LIT-MS2 measurements were quantitatively accurate down to 0.5 ng on column. Finally, we optimized a suitable strategy for spectral library generation from low-input material, which we used to analyze single-cell samples by LIT-DIA using LIT-based libraries generated from as few as 40 cells.
Collapse
|