1
|
Xiao Z, Tüshaus J, Kuster B, The M, Wilhelm M. SWAPS: A Modular Deep-Learning Empowered Peptide Identity Propagation Framework Beyond Match-Between-Run. J Proteome Res 2025; 24:1926-1940. [PMID: 40052690 PMCID: PMC11976850 DOI: 10.1021/acs.jproteome.4c00972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Revised: 02/04/2025] [Accepted: 02/21/2025] [Indexed: 04/05/2025]
Abstract
Mass spectrometry (MS)-based proteomics relies heavily on MS/MS (MS2) data, which do not fully exploit the available MS1 information. Traditional peptide identity propagation (PIP) methods, such as match-between-runs (MBR), are limited to similar runs, particularly with the same liquid chromatography (LC) gradients, thus potentially underutilizing available proteomics libraries. We introduce SWAPS, a novel and modular MS1-centric framework incorporating advances in peptide property prediction, extensive proteomics libraries, and deep-learning-based postprocessing to enable and explore PIP across more diverse experimental conditions and LC gradients. SWAPS substantially enhances precursor identification, especially in shorter gradients. On the example of 30, 15, and 7.5 min gradients, SWAPS achieves increases of 46.3, 86.2, and 112.1% on precursor level over MaxQuant's MS2-based identifications. Despite the inherent challenges in controlling false discovery rates (FDR) with MS1-based methods, SWAPS demonstrates strong efficacy in deconvoluting MS1 signals, offering powerful discrimination and deeper sequence exploration, while maintaining quantitative accuracy. By building on and applying peptide property predictions in practical contexts, SWAPS reveals that current models, while advanced, are still not fully comparable to experimental measurements, sparking the need for further research. Additionally, its modular design allows seamless integration of future improvements, positioning SWAPS as a forward-looking tool in proteomics.
Collapse
Affiliation(s)
- Zixuan Xiao
- Computational
Mass Spectrometry, School of Life Sciences, Technical University of Munich, Freising 85354, Germany
| | - Johanna Tüshaus
- Chair
of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising 85354, Germany
| | - Bernhard Kuster
- Chair
of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising 85354, Germany
- Munich
Data Science Institute (MDSI), Technical
University of Munich, Garching 85748, Germany
| | - Matthew The
- Chair
of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, Freising 85354, Germany
| | - Mathias Wilhelm
- Computational
Mass Spectrometry, School of Life Sciences, Technical University of Munich, Freising 85354, Germany
- Munich
Data Science Institute (MDSI), Technical
University of Munich, Garching 85748, Germany
| |
Collapse
|
2
|
Sanchez-Avila X, de Oliveira RM, Huang S, Wang C, Kelly RT. Trends in Mass Spectrometry-Based Single-Cell Proteomics. Anal Chem 2025; 97:5893-5907. [PMID: 40091206 PMCID: PMC12003028 DOI: 10.1021/acs.analchem.5c00661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2025]
Affiliation(s)
- Ximena Sanchez-Avila
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| | - Raphaela M de Oliveira
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| | - Siqi Huang
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| | - Chao Wang
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| | - Ryan T Kelly
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, Utah 84602, United States
| |
Collapse
|
3
|
Ivanov MV, Kopeykina AS, Kazakova EM, Tarasova IA, Sun Z, Postoenko VI, Yang J, Gorshkov MV. Modified Decision Tree with Custom Splitting Logic Improves Generalization across Multiple Brains' Proteomic Data Sets of Alzheimer's Disease. J Proteome Res 2025; 24:1053-1066. [PMID: 39984290 DOI: 10.1021/acs.jproteome.4c00677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2025]
Abstract
Many factors negatively affect a generalization of the findings in discovery proteomics. They include differentiation between patient cohorts, a variety of experimental conditions, etc. We presented a machine-learning-based workflow for proteomics data analysis, aiming at improving generalizability across multiple data sets. In particular, we customized the decision tree model by introducing a new parameter, min_groups_leaf, which regulates the presence of the samples from each data set inside the model's leaves. Further, we analyzed a trend for the feature importance's curve as a function of the novel parameter for feature selection to a list of proteins with significantly improved generalization. The developed workflow was tested using five proteomic data sets obtained for post-mortem human brain samples of Alzheimer's disease. The data sets consisted of 535 LC-MS/MS acquisition files. The results were obtained for two different pipelines of data processing: (1) MS1-only processing based on DirectMS1 search engine and (2) a standard MS/MS-based one. Using the developed workflow, we found seven proteins with expression patterns that were unique for asymptomatic Alzheimer patients. Two of them, Serotransferrin TRFE and DNA repair nuclease APEX1, may be potentially important for explaining the lack of dementia in patients with the presence of neuritic plaques and neurofibrillary tangles.
Collapse
Affiliation(s)
- Mark V Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Anna S Kopeykina
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Elizaveta M Kazakova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Irina A Tarasova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Zhao Sun
- Clinical Systems Biology Key Laboratory, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
- Institute of Infection and Immunity, Henan Academy of Innovations in Medical Science, Zhengzhou 450052, China
| | - Valeriy I Postoenko
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Jinghua Yang
- Clinical Systems Biology Key Laboratory, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450052, China
- Institute of Infection and Immunity, Henan Academy of Innovations in Medical Science, Zhengzhou 450052, China
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| |
Collapse
|
4
|
Frantsuzova E, Bogun A, Vetrova A, Kazakova E, Kusainova T, Tarasova I, Pozdnyakova-Filatova I, Delegan Y. Integrated Omics Approaches to Explore a New System of Genetic Control of Dibenzothiophene Desulfurization and Aromatic Ring Cleavage by Gordonia alkanivorans Strain 135. BIOLOGY 2025; 14:188. [PMID: 40001956 PMCID: PMC11852219 DOI: 10.3390/biology14020188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2025] [Revised: 02/05/2025] [Accepted: 02/10/2025] [Indexed: 02/27/2025]
Abstract
Dibenzothiophene (DBT) is a widespread environmental pollutant. The most common metabolic pathway for DBT degradation by Gordonia strains is the 4S pathway, which is under the control of the dsz operon. The ability to utilize DBT as the sole source of sulfur in Gordonia alkanivorans strain 135 has been revealed. The dsz operon was not detected in the genome of strain 135. In this work, using genomic, transcriptomic, and proteomic data of strain 135, it was shown that an alternative pathway of DBT transformation is possible in non-dsz Gordonia; the sfnB and tauD genes and two acyl-dehydrogenase genes are significantly involved in the desulfurization process.
Collapse
Affiliation(s)
- Ekaterina Frantsuzova
- Institute of Biochemistry and Physiology of Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of Russian Academy of Sciences” (FRC PSCBR RAS), Pushchino 142290, Russia; (E.F.); (A.B.); (A.V.); (I.P.-F.)
| | - Alexander Bogun
- Institute of Biochemistry and Physiology of Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of Russian Academy of Sciences” (FRC PSCBR RAS), Pushchino 142290, Russia; (E.F.); (A.B.); (A.V.); (I.P.-F.)
| | - Anna Vetrova
- Institute of Biochemistry and Physiology of Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of Russian Academy of Sciences” (FRC PSCBR RAS), Pushchino 142290, Russia; (E.F.); (A.B.); (A.V.); (I.P.-F.)
| | - Elizaveta Kazakova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia; (E.K.); (T.K.); (I.T.)
| | - Tomiris Kusainova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia; (E.K.); (T.K.); (I.T.)
| | - Irina Tarasova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia; (E.K.); (T.K.); (I.T.)
| | - Irina Pozdnyakova-Filatova
- Institute of Biochemistry and Physiology of Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of Russian Academy of Sciences” (FRC PSCBR RAS), Pushchino 142290, Russia; (E.F.); (A.B.); (A.V.); (I.P.-F.)
| | - Yanina Delegan
- Institute of Biochemistry and Physiology of Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of Russian Academy of Sciences” (FRC PSCBR RAS), Pushchino 142290, Russia; (E.F.); (A.B.); (A.V.); (I.P.-F.)
| |
Collapse
|
5
|
Solivais AJ, Boekweg H, Smith LM, Noble WS, Shortreed MR, Payne SH, Keich U. Improved detection of differentially abundant proteins through FDR-control of peptide-identity-propagation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.15.623880. [PMID: 39605422 PMCID: PMC11601340 DOI: 10.1101/2024.11.15.623880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
The goal of proteomics is to identify and quantify peptides and proteins within a biological sample. Almost all algorithms for the identification of peptides in LC-MS/MS data employ two steps: peptide/spectrum matching and peptide-identity-propagation (PIP), also known as match-between-runs. PIP was originally envisioned as a backup method to overcome measurement stochasticity. However, current PIP implementations can routinely account for up to 40% of all results, with that proportion rising as high as 75% in single-cell proteomics. Unlike peptide identities derived through peptide/spectrum matches, for which error estimation has been strictly enforced for decades, peptide identities derived through PIP have not historically been subject to statistical evaluation. As an indispensable component of label free quantification, PIP needs a simple and statistically rigorous method for estimating its error rates. Although several tools claim to control the false discovery rate (FDR) of PIP, these claims cannot be validated as there is currently no accepted method to assess the accuracy of the stated FDR. We present a method for FDR control of PIP, called PIP-ECHO, and devise a rigorous protocol for evaluating FDR control of any PIP method. Using three different benchmark datasets, we evaluate PIP-ECHO alongside the PIP procedures implemented by FlashLFQ, IonQuant, and MaxQuant. These analyses show that only PIP-ECHO can accurately control the FDR of PIP at 1% across all datasets, including single cell. When analyzing spike-in datasets where different known amounts of yeast or E. coli peptides are added to a constant background of human peptides, PIP-ECHO increases both the accuracy and sensitivity of differential expression analysis, yielding 53% more differentially abundant proteins than MaxQuant and 146% more than IonQuant.
Collapse
|
6
|
Urrutia M, Blein-Nicolas M, Fernandez O, Bernillon S, Maucourt M, Deborde C, Balliau T, Rabier D, Bénard C, Prigent S, Quilleré I, Jacob D, Gibon Y, Zivy M, Giauffret C, Hirel B, Moing A. Identification of metabolic and protein markers representative of the impact of mild nitrogen deficit on agronomic performance of maize hybrids. Metabolomics 2024; 20:128. [PMID: 39520587 PMCID: PMC11550246 DOI: 10.1007/s11306-024-02186-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 10/10/2024] [Indexed: 11/16/2024]
Abstract
INTRODUCTION A better understanding of the physiological response of silage maize to a mild reduction in nitrogen (N) fertilization and the identification of predictive biochemical markers of N utilization efficiency could contribute to limit the detrimental effect of the overuse of N inputs. OBJECTIVES We integrated phenotypic and biochemical data to interpret the physiology of maize in response to a mild reduction in N fertilization under agronomic conditions and identify predictive leaf metabolic and proteic markers that could be used to pilot and rationalize N fertilization. METHODS Eco-physiological, developmental and yield-related traits were measured and complemented with metabolomic and proteomic approaches performed on young leaves of a core panel of 29 European genetically diverse dent hybrids cultivated in the field under non-limiting and reduced N fertilization conditions. RESULTS Metabolome and proteome data were analyzed either individually or in an integrated manner together with eco-physiological, developmental, phenotypic and yield-related traits. They allowed to identify (i) common N-responsive metabolites and proteins that could be used as predictive markers to monitor N fertilization, (ii) silage maize hybrids that exhibit improved agronomic performance when N fertilization is reduced. CONCLUSIONS Among the N-responsive metabolites and proteins identified, a cytosolic NADP-dependent malic enzyme and four metabolite signatures stand out as promising markers that could be used for both breeding and agronomic purposes.
Collapse
Affiliation(s)
- Maria Urrutia
- INRAE, Université de Liège, Université de Lille, Université de Picardie Jules Verne, UMR BioEcoAgro, AgroImpact, Site d'Estrées Mons, 80203, Péronne, France
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Departamento de Mejora Genética y Biotecnología, Instituto de Hortofruticultura Subtropical y Mediterránea (IHSM), UMA-CSIC, Av. Luis Pasteur 49, 29071, Málaga, Spain
| | - Mélisande Blein-Nicolas
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, PAPPSO, 91190, Gif-Sur-Yvette, France
| | - Olivier Fernandez
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- University of Reims Champagne-Ardenne, Résistance Induite et Bioprotection des Plantes Research Unit, EA 4707, INRAE USC 1488, SFR Condorcet FR CNRS 3417, 51000, Reims, France
| | - Stéphane Bernillon
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| | - Mickaël Maucourt
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| | - Catherine Deborde
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| | - Thierry Balliau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, PAPPSO, 91190, Gif-Sur-Yvette, France
| | | | - Camille Bénard
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| | - Sylvain Prigent
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| | - Isabelle Quilleré
- Université Paris-Saclay, INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France
| | - Daniel Jacob
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| | - Yves Gibon
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| | - Michel Zivy
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, PAPPSO, 91190, Gif-Sur-Yvette, France
| | - Catherine Giauffret
- INRAE, Université de Liège, Université de Lille, Université de Picardie Jules Verne, UMR BioEcoAgro, AgroImpact, Site d'Estrées Mons, 80203, Péronne, France
| | - Bertrand Hirel
- Université Paris-Saclay, INRAE, AgroParisTech, Institut Jean-Pierre Bourgin (IJPB), 78000, Versailles, France.
| | - Annick Moing
- INRAE, Univ. Bordeaux, Biologie du Fruit et Pathologie, UMR1332, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
- Bordeaux Metabolome, MetaboHUB, PHENOME-EMPHASIS, Centre INRAE de Nouvelle-Aquitaine Bordeaux, 33140, Villenave d'Ornon, France
| |
Collapse
|
7
|
Xu W, Zhang L, Qian X, Sun N, Tu X, Zhou D, Zheng X, Chen J, Xie Z, He T, Qu S, Wang Y, Yang K, Su K, Feng S, Ju B. A deep learning framework for hepatocellular carcinoma diagnosis using MS1 data. Sci Rep 2024; 14:26705. [PMID: 39496730 PMCID: PMC11535524 DOI: 10.1038/s41598-024-77494-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 10/22/2024] [Indexed: 11/06/2024] Open
Abstract
Clinical proteomics analysis is of great significance for analyzing pathological mechanisms and discovering disease-related biomarkers. Using computational methods to accurately predict disease types can effectively improve patient disease diagnosis and prognosis. However, how to eliminate the errors introduced by peptide precursor identification and protein identification for pathological diagnosis remains a major unresolved issue. Here, we develop a powerful end-to-end deep learning model, termed "MS1Former", that is able to classify hepatocellular carcinoma tumors and adjacent non-tumor (normal) tissues directly using raw MS1 spectra without peptide precursor identification. Our model provides accurate discrimination of subtle m/z differences in MS1 between tumor and adjacent non-tumor tissue, as well as more general performance predictions for data-dependent acquisition, data-independent acquisition, and full-scan data. Our model achieves the best performance on multiple external validation datasets. Additionally, we perform a detailed exploration of the model's interpretability. Prospectively, we expect that the advanced end-to-end framework will be more applicable to the classification of other tumors.
Collapse
Affiliation(s)
- Wei Xu
- College of Basic Medical Science, Zhejiang Chinese Medical University, 548 Binwen Rd, Hangzhou, 310053, China
- Key Laboratory of Chinese Medicine Rheumatology of Zhejiang Province, 548 Binwen Rd, Hangzhou, 310053, China
| | - Liying Zhang
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China
| | - Xiaoliang Qian
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China
| | - Nannan Sun
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China
| | - Xiao Tu
- College of Basic Medical Science, Zhejiang Chinese Medical University, 548 Binwen Rd, Hangzhou, 310053, China
- Key Laboratory of Zhejiang Province, Management of Kidney Disease, Hangzhou, 310000, China
| | - Dengfeng Zhou
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China
| | - Xiaoping Zheng
- Pathology Department, Shulan (Hangzhou) Hospital, Hangzhou, China
| | - Jia Chen
- School of Life Sciences, Key Laboratory of Structural Biology of Zhejiang Province, Westlake University, Hangzhou, 310024, China
- The Biomedical Research Core Facility, Mass Spectrometry and Metabolomics Core Facility, Westlake University, Hangzhou, 310024, China
| | - Zewen Xie
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China
| | - Tao He
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China
| | - Shugang Qu
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China
| | - Yinjia Wang
- The First People's Hospital of Kunming, Intensive Care Unit, Kunming, 650032, China.
| | - Keda Yang
- Key Laboratory of Artificial Organs and Computational Medicine in Zhejiang Province, Shulan International Medical College, Zhejiang Shuren University, Hangzhou, 310015, China.
| | - Kunkai Su
- The First Affiliated Hospital, State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, Zhejiang University School of Medicine, 79 Qingchun Road, Hangzhou, 310013, China.
| | - Shan Feng
- School of Life Sciences, Key Laboratory of Structural Biology of Zhejiang Province, Westlake University, Hangzhou, 310024, China.
- The Biomedical Research Core Facility, Mass Spectrometry and Metabolomics Core Facility, Westlake University, Hangzhou, 310024, China.
| | - Bin Ju
- SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China.
- Innovative Institute of Basic Medical Sciences, Zhejiang University, Hangzhou, 310022, Zhejiang, China.
| |
Collapse
|
8
|
Dai Y, Yang Y, Wu E, Shen C, Qiao L. Deep Learning Powers Protein Identification from Precursor MS Information. J Proteome Res 2024; 23:3837-3846. [PMID: 39167422 DOI: 10.1021/acs.jproteome.4c00118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/23/2024]
Abstract
Proteome analysis currently heavily relies on tandem mass spectrometry (MS/MS), which does not fully utilize MS1 features, as many precursors remain unselected for MS/MS fragmentation, especially in the cases of low abundance samples and wide abundance dynamic range samples. Therefore, leveraging MS1 features as a complement to MS/MS has become an attractive option to improve the coverage of feature identification. Herein, we propose MonoMS1, an approach combining deep learning-based retention time, ion mobility, detectability prediction, and logistic regression-based scoring for MS1 feature identification. The approach achieved a significant increase in MS1 feature identification based on an E. coli data set. Application of MonoMS1 to data sets with wide dynamic range, such as human serum proteome samples, and with low sample abundance, such as single-cell proteome samples, enabled substantial complementation of MS/MS-based peptide and protein identification. This method opens a new avenue for proteomic analysis and can boost proteomic research on complex samples.
Collapse
Affiliation(s)
- Yameng Dai
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
| | - Yi Yang
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311200, China
| | - Enhui Wu
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
| | - Chengpin Shen
- Shanghai Omicsolution Co., Ltd., Shanghai 201100, China
| | - Liang Qiao
- Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China
| |
Collapse
|
9
|
Jiang Y, Rex DA, Schuster D, Neely BA, Rosano GL, Volkmar N, Momenzadeh A, Peters-Clarke TM, Egbert SB, Kreimer S, Doud EH, Crook OM, Yadav AK, Vanuopadath M, Hegeman AD, Mayta M, Duboff AG, Riley NM, Moritz RL, Meyer JG. Comprehensive Overview of Bottom-Up Proteomics Using Mass Spectrometry. ACS MEASUREMENT SCIENCE AU 2024; 4:338-417. [PMID: 39193565 PMCID: PMC11348894 DOI: 10.1021/acsmeasuresciau.3c00068] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 05/03/2024] [Accepted: 05/03/2024] [Indexed: 08/29/2024]
Abstract
Proteomics is the large scale study of protein structure and function from biological systems through protein identification and quantification. "Shotgun proteomics" or "bottom-up proteomics" is the prevailing strategy, in which proteins are hydrolyzed into peptides that are analyzed by mass spectrometry. Proteomics studies can be applied to diverse studies ranging from simple protein identification to studies of proteoforms, protein-protein interactions, protein structural alterations, absolute and relative protein quantification, post-translational modifications, and protein stability. To enable this range of different experiments, there are diverse strategies for proteome analysis. The nuances of how proteomic workflows differ may be challenging to understand for new practitioners. Here, we provide a comprehensive overview of different proteomics methods. We cover from biochemistry basics and protein extraction to biological interpretation and orthogonal validation. We expect this Review will serve as a handbook for researchers who are new to the field of bottom-up proteomics.
Collapse
Affiliation(s)
- Yuming Jiang
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| | - Devasahayam Arokia
Balaya Rex
- Center for
Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Dina Schuster
- Department
of Biology, Institute of Molecular Systems
Biology, ETH Zurich, Zurich 8093, Switzerland
- Department
of Biology, Institute of Molecular Biology
and Biophysics, ETH Zurich, Zurich 8093, Switzerland
- Laboratory
of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen 5232, Switzerland
| | - Benjamin A. Neely
- Chemical
Sciences Division, National Institute of
Standards and Technology, NIST, Charleston, South Carolina 29412, United States
| | - Germán L. Rosano
- Mass
Spectrometry
Unit, Institute of Molecular and Cellular
Biology of Rosario, Rosario, 2000 Argentina
| | - Norbert Volkmar
- Department
of Biology, Institute of Molecular Systems
Biology, ETH Zurich, Zurich 8093, Switzerland
| | - Amanda Momenzadeh
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| | - Trenton M. Peters-Clarke
- Department
of Pharmaceutical Chemistry, University
of California—San Francisco, San Francisco, California, 94158, United States
| | - Susan B. Egbert
- Department
of Chemistry, University of Manitoba, Winnipeg, Manitoba, R3T 2N2 Canada
| | - Simion Kreimer
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| | - Emma H. Doud
- Center
for Proteome Analysis, Indiana University
School of Medicine, Indianapolis, Indiana, 46202-3082, United States
| | - Oliver M. Crook
- Oxford
Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, United
Kingdom
| | - Amit Kumar Yadav
- Translational
Health Science and Technology Institute, NCR Biotech Science Cluster 3rd Milestone Faridabad-Gurgaon
Expressway, Faridabad, Haryana 121001, India
| | | | - Adrian D. Hegeman
- Departments
of Horticultural Science and Plant and Microbial Biology, University of Minnesota, Twin Cities, Minnesota 55108, United States
| | - Martín
L. Mayta
- School
of Medicine and Health Sciences, Center for Health Sciences Research, Universidad Adventista del Plata, Libertador San Martin 3103, Argentina
- Molecular
Biology Department, School of Pharmacy and Biochemistry, Universidad Nacional de Rosario, Rosario 2000, Argentina
| | - Anna G. Duboff
- Department
of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Nicholas M. Riley
- Department
of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Robert L. Moritz
- Institute
for Systems biology, Seattle, Washington 98109, United States
| | - Jesse G. Meyer
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| |
Collapse
|
10
|
Fedorov II, Protasov SA, Tarasova IA, Gorshkov MV. Ultrafast Proteomics. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:1349-1361. [PMID: 39245450 DOI: 10.1134/s0006297924080017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 06/21/2024] [Accepted: 06/24/2024] [Indexed: 09/10/2024]
Abstract
Current stage of proteomic research in the field of biology, medicine, development of new drugs, population screening, or personalized approaches to therapy dictates the need to analyze large sets of samples within the reasonable experimental time. Until recently, mass spectrometry measurements in proteomics were characterized as unique in identifying and quantifying cellular protein composition, but low throughput, requiring many hours to analyze a single sample. This was in conflict with the dynamics of changes in biological systems at the whole cellular proteome level upon the influence of external and internal factors. Thus, low speed of the whole proteome analysis has become the main factor limiting developments in functional proteomics, where it is necessary to annotate intracellular processes not only in a wide range of conditions, but also over a long period of time. Enormous level of heterogeneity of tissue cells or tumors, even of the same type, dictates the need to analyze biological systems at the level of individual cells. These studies involve obtaining molecular characteristics for tens, if not hundreds of thousands of individual cells, including their whole proteome profiles. Development of mass spectrometry technologies providing high resolution and mass measurement accuracy, predictive chromatography, new methods for peptide separation by ion mobility and processing of proteomic data based on artificial intelligence algorithms have opened a way for significant, if not radical, increase in the throughput of whole proteome analysis and led to implementation of the novel concept of ultrafast proteomics. Work done just in the last few years has demonstrated the proteome-wide analysis throughput of several hundred samples per day at a depth of several thousand proteins, levels unimaginable three or four years ago. The review examines background of these developments, as well as modern methods and approaches that implement ultrafast analysis of the entire proteome.
Collapse
Affiliation(s)
- Ivan I Fedorov
- Moscow Institute of Physics and Technology (National University), Dolgoprudny, Moscow Region, 141700, Russia
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Sergey A Protasov
- Moscow Institute of Physics and Technology (National University), Dolgoprudny, Moscow Region, 141700, Russia
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Irina A Tarasova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia.
| |
Collapse
|
11
|
Jiang Y, DeBord D, Vitrac H, Stewart J, Haghani A, Van Eyk JE, Fert-Bober J, Meyer JG. The Future of Proteomics is Up in the Air: Can Ion Mobility Replace Liquid Chromatography for High Throughput Proteomics? J Proteome Res 2024; 23:1871-1882. [PMID: 38713528 PMCID: PMC11161313 DOI: 10.1021/acs.jproteome.4c00248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/09/2024]
Abstract
The coevolution of liquid chromatography (LC) with mass spectrometry (MS) has shaped contemporary proteomics. LC hyphenated to MS now enables quantification of more than 10,000 proteins in a single injection, a number that likely represents most proteins in specific human cells or tissues. Separations by ion mobility spectrometry (IMS) have recently emerged to complement LC and further improve the depth of proteomics. Given the theoretical advantages in speed and robustness of IMS in comparison to LC, we envision that ongoing improvements to IMS paired with MS may eventually make LC obsolete, especially when combined with targeted or simplified analyses, such as rapid clinical proteomics analysis of defined biomarker panels. In this perspective, we describe the need for faster analysis that might drive this transition, the current state of direct infusion proteomics, and discuss some technical challenges that must be overcome to fully complete the transition to entirely gas phase proteomics.
Collapse
Affiliation(s)
- Yuming Jiang
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Daniel DeBord
- MOBILion Systems Inc., Chadds Ford, Pennsylvania 19317, United States
| | - Heidi Vitrac
- MOBILion Systems Inc., Chadds Ford, Pennsylvania 19317, United States
| | - Jordan Stewart
- MOBILion Systems Inc., Chadds Ford, Pennsylvania 19317, United States
| | - Ali Haghani
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Jennifer E Van Eyk
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Justyna Fert-Bober
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Jesse G Meyer
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
- The Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| |
Collapse
|
12
|
Xu A, Tang LC, Jovanovic M, Regev O. Uncovering Distinct Peptide Charging Behaviors in Electrospray Ionization Mass Spectrometry Using a Large-Scale Dataset. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:90-99. [PMID: 38095561 PMCID: PMC10767741 DOI: 10.1021/jasms.3c00325] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/20/2023] [Accepted: 11/22/2023] [Indexed: 12/26/2023]
Abstract
Electrospray ionization is a powerful and prevalent technique used to ionize analytes in mass spectrometry. The distribution of charges that an analyte receives (charge state distribution, CSD) is an important consideration for interpreting mass spectra. However, due to an incomplete understanding of the ionization mechanism, the analyte properties that influence CSDs are not fully understood. Here, we employ a machine learning-based approach and analyze CSDs of hundreds of thousands of peptides. Interestingly, half of the peptides exhibit charges that differ from what one would naively expect (the number of basic sites). We find that these peptides can be classified into two regimes (undercharging and overcharging) and that these two regimes display markedly different charging characteristics. Notably, peptides in the overcharging regime show minimal dependence on basic site count, and more generally, the two regimes exhibit distinct sequence determinants. These findings highlight the rich ionization behavior of peptides and the potential of CSDs for enhancing peptide identification.
Collapse
Affiliation(s)
- Allyn
M. Xu
- Department
of Mathematics, Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| | - Lauren C. Tang
- Department
of Biological Sciences, Columbia University, New York, New York 10027, United States
| | - Marko Jovanovic
- Department
of Biological Sciences, Columbia University, New York, New York 10027, United States
| | - Oded Regev
- Computer
Science Department, Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| |
Collapse
|
13
|
Kusainova TT, Emekeeva DD, Kazakova EM, Gorshkov VA, Kjeldsen F, Kuskov ML, Zhigach AN, Olkhovskaya IP, Bogoslovskaya OA, Glushchenko NN, Tarasova IA. Ultra-Fast Mass Spectrometry in Plant Biochemistry: Response of Winter Wheat Proteomics to Pre-Sowing Treatment with Iron Compounds. BIOCHEMISTRY. BIOKHIMIIA 2023; 88:1390-1403. [PMID: 37770405 DOI: 10.1134/s0006297923090183] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 08/06/2023] [Accepted: 08/09/2023] [Indexed: 09/30/2023]
Abstract
In recent years, ultrafast liquid chromatography/mass spectrometry methods have been extensively developed for the use in proteome profiling in biochemical studies. These methods are intended for express monitoring of cell response to biotic stimuli and elucidation of correlation of molecular changes with biological processes and phenotypical changes. New technologies, including the use of nanomaterials, are actively introduced to increase agricultural production. However, this requires complex approbation of new fertilizers and investigation of mechanisms underlying the biotic effects on the germination, growth, and development of plants. The aim of this work was to adapt the method of ultrafast chromatography/mass spectrometry for rapid quantitative profiling of molecular changes in 7-day-old wheat seedlings in response to pre-sowing seed treatment with iron compounds. The used method allows to analyze up to 200 samples per day; its practical value lies in the possibility of express proteomic diagnostics of the biotic action of new treatments, including those intended for agricultural needs. Changes in the regulation of photosynthesis, biosynthesis of chlorophyll and porphyrin- and tetrapyrrole-containing compounds, glycolysis (in shoot tissues), and polysaccharide metabolism (in root tissues) were shown after seed treatment with suspensions containing film-forming polymers (PEG 400, Na-CMC, Na2-EDTA), iron (II, III) nanoparticles, or iron (II) sulfate. Observations at the protein levels were consistent with the results of morphometry, superoxide dismutase activity assay, and microelement analysis of 3-day-old germinated seeds and shoots and roots of 7-day-old seedlings. A characteristic molecular signature involving proteins participating in the regulation of photosynthesis and glycolytic process was suggested as a potential marker of the biotic effects of seed treatment with iron compounds, which will be confirmed in further studies.
Collapse
Affiliation(s)
- Tomiris T Kusainova
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Daria D Emekeeva
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Elizaveta M Kazakova
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| | - Vladimir A Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, DK-5230, Denmark
| | - Frank Kjeldsen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, DK-5230, Denmark
| | - Mikhail L Kuskov
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Alexey N Zhigach
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Irina P Olkhovskaya
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Olga A Bogoslovskaya
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Natalia N Glushchenko
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Irina A Tarasova
- Talroze Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia.
| |
Collapse
|
14
|
Postoenko VI, Garibova LA, Levitsky LI, Bubis JA, Gorshkov MV, Ivanov MV. IQMMA: Efficient MS1 Intensity Extraction Pipeline Using Multiple Feature Detection Algorithms for DDA Proteomics. J Proteome Res 2023; 22:2827-2835. [PMID: 37579078 DOI: 10.1021/acs.jproteome.3c00075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/16/2023]
Abstract
One of the key steps in data dependent acquisition (DDA) proteomics is detection of peptide isotopic clusters, also called "features", in MS1 spectra and matching them to MS/MS-based peptide identifications. A number of peptide feature detection tools became available in recent years, each relying on its own matching algorithm. Here, we provide an integrated solution, the intensity-based Quantitative Mix and Match Approach (IQMMA), which integrates a number of untargeted peptide feature detection algorithms and returns the most probable intensity values for the MS/MS-based identifications. IQMMA was tested using available proteomic data acquired for both well-characterized (ground truth) and real-world biological samples, including a mix of Yeast and E. coli digests spiked at different concentrations into the Human K562 digest used as a background, and a set of glioblastoma cell lines. Three open-source feature detection algorithms were integrated: Dinosaur, biosaur2, and OpenMS FeatureFinder. None of them was found optimal when applied individually to all the data sets employed in this work; however, their combined use in IQMMA improved efficiency of subsequent protein quantitation. The software implementing IQMMA is freely available at https://github.com/PostoenkoVI/IQMMA under Apache 2.0 license.
Collapse
Affiliation(s)
- Valeriy I Postoenko
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
- Moscow Institute of Physics and Technology, National Research University, G. Dolgoprudny, Institutsky Lane 9, Dolgoprudny 141701, Russia
| | - Leyla A Garibova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
- Moscow Institute of Physics and Technology, National Research University, G. Dolgoprudny, Institutsky Lane 9, Dolgoprudny 141701, Russia
| | - Lev I Levitsky
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Julia A Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| | - Mark V Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia
| |
Collapse
|
15
|
Penanes P, Gorshkov V, Ivanov MV, Gorshkov MV, Kjeldsen F. Potential of Negative-Ion-Mode Proteomics: An MS1-Only Approach. J Proteome Res 2023; 22:2734-2742. [PMID: 37395192 PMCID: PMC10407931 DOI: 10.1021/acs.jproteome.3c00307] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Indexed: 07/04/2023]
Abstract
Current proteomics approaches rely almost exclusively on using the positive ionization mode, resulting in inefficient ionization of many acidic peptides. This study investigates protein identification efficiency in the negative ionization mode using the DirectMS1 method. DirectMS1 is an ultrafast data acquisition method based on accurate peptide mass measurements and predicted retention times. Our method achieves the highest rate of protein identification in the negative ion mode to date, identifying over 1000 proteins in a human cell line at a 1% false discovery rate. This is accomplished using a single-shot 10 min separation gradient, comparable to lengthy MS/MS-based analyses. Optimizing separation and experimental conditions was achieved by utilizing mobile buffers containing 2.5 mM imidazole and 3% isopropanol. The study emphasized the complementary nature of data obtained in positive and negative ion modes. Combining the results from all replicates in both polarities increased the number of identified proteins to 1774. Additionally, we analyzed the method's efficiency using different proteases for protein digestion. Among the four studied proteases (LysC, GluC, AspN, and trypsin), trypsin and LysC demonstrated the highest protein identification yield. This suggests that digestion procedures utilized in positive-mode proteomics can be effectively applied in the negative ion mode. Data are deposited to ProteomeXchange: PXD040583.
Collapse
Affiliation(s)
- Pelayo
A. Penanes
- Department
of Biochemistry and Molecular Biology, University
of Southern Denmark, DK-5230 Odense M, Denmark
| | - Vladimir Gorshkov
- Department
of Biochemistry and Molecular Biology, University
of Southern Denmark, DK-5230 Odense M, Denmark
| | - Mark V. Ivanov
- V.
L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical
Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
| | - Mikhail V. Gorshkov
- V.
L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical
Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
| | - Frank Kjeldsen
- Department
of Biochemistry and Molecular Biology, University
of Southern Denmark, DK-5230 Odense M, Denmark
| |
Collapse
|
16
|
Matzinger M, Mayer RL, Mechtler K. Label-free single cell proteomics utilizing ultrafast LC and MS instrumentation: A valuable complementary technique to multiplexing. Proteomics 2023; 23:e2200162. [PMID: 36806919 PMCID: PMC10909491 DOI: 10.1002/pmic.202200162] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/21/2023]
Abstract
The ability to map a proteomic fingerprint to transcriptomic data would master the understanding of how gene expression translates into actual phenotype. In contrast to nucleic acid sequencing, in vitro protein amplification is impossible and no single cell proteomic workflow has been established as gold standard yet. Advances in microfluidic sample preparation, multi-dimensional sample separation, sophisticated data acquisition strategies, and intelligent data analysis algorithms have resulted in major improvements to successfully analyze such tiny sample amounts with steadily boosted performance. However, among the broad variation of published approaches, it is commonly accepted that highest possible sensitivity, robustness, and throughput are still the most urgent needs for the field. While many labs have focused on multiplexing to achieve these goals, label-free SCP is a highly promising strategy as well whenever high dynamic range and unbiased accurate quantification are needed. We here focus on recent advances in label-free single-cell mass spectrometry workflows and try to guide our readers to choose the best method or combinations of methods for their specific applications. We further highlight which techniques are most propitious in the future and which applications but also limitations we foresee for the field.
Collapse
Affiliation(s)
- Manuel Matzinger
- Research Institute of Molecular Pathology (IMP)Vienna BioCenterViennaAustria
| | - Rupert L. Mayer
- Research Institute of Molecular Pathology (IMP)Vienna BioCenterViennaAustria
| | - Karl Mechtler
- Research Institute of Molecular Pathology (IMP)Vienna BioCenterViennaAustria
- Gregor Mendel Institute of Molecular Plant Biology (GMI), Austrian Academy of SciencesVienna BioCenter (VBC)ViennaAustria
- Institute of Molecular Biotechnology (IMBA), Austrian Academy of SciencesVienna BioCenter (VBC)ViennaAustria
| |
Collapse
|
17
|
Xu AM, Tang LC, Jovanovic M, Regev O. A high-throughput approach reveals distinct peptide charging behaviors in electrospray ionization mass spectrometry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.31.535171. [PMID: 37066236 PMCID: PMC10103939 DOI: 10.1101/2023.03.31.535171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Electrospray ionization is a powerful and prevalent technique used to ionize analytes in mass spectrometry. The distribution of charges that an analyte receives (charge state distribution, CSD) is an important consideration for interpreting mass spectra. However, due to an incomplete understanding of the ionization mechanism, the analyte properties that influence CSDs are not fully understood. Here, we employ a machine learning-based high-throughput approach and analyze CSDs of hundreds of thousands of peptides. Interestingly, half of the peptides exhibit charges that differ from what one would naively expect (number of basic sites). We find that these peptides can be classified into two regimes-undercharging and overcharging-and that these two regimes display markedly different charging characteristics. Strikingly, peptides in the overcharging regime show minimal dependence on basic site count, and more generally, the two regimes exhibit distinct sequence determinants. These findings highlight the rich ionization behavior of peptides and the potential of CSDs for enhancing peptide identification.
Collapse
Affiliation(s)
- Allyn M. Xu
- Department of Mathematics, Courant Institute of Mathematical Sciences, New York University, NY, USA
| | - Lauren C. Tang
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Oded Regev
- Computer Science Department, Courant Institute of Mathematical Sciences, New York University, NY, USA
| |
Collapse
|
18
|
Matzinger M, Müller E, Dürnberger G, Pichler P, Mechtler K. Robust and Easy-to-Use One-Pot Workflow for Label-Free Single-Cell Proteomics. Anal Chem 2023; 95:4435-4445. [PMID: 36802514 PMCID: PMC9996606 DOI: 10.1021/acs.analchem.2c05022] [Citation(s) in RCA: 71] [Impact Index Per Article: 35.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 02/07/2023] [Indexed: 02/22/2023]
Abstract
The analysis of ultralow input samples or even individual cells is essential to answering a multitude of biomedical questions, but current proteomic workflows are limited in their sensitivity and reproducibility. Here, we report a comprehensive workflow that includes improved strategies for all steps, from cell lysis to data analysis. Thanks to convenient-to-handle 1 μL sample volume and standardized 384-well plates, the workflow is easy for even novice users to implement. At the same time, it can be performed semi-automatized using CellenONE, which allows for the highest reproducibility. To achieve high throughput, ultrashort gradient lengths down to 5 min were tested using advanced μ-pillar columns. Data-dependent acquisition (DDA), wide-window acquisition (WWA), data-independent acquisition (DIA), and commonly used advanced data analysis algorithms were benchmarked. Using DDA, 1790 proteins covering a dynamic range of four orders of magnitude were identified in a single cell. Using DIA, proteome coverage increased to more than 2200 proteins identified from single-cell level input in a 20 min active gradient. The workflow enabled differentiation of two cell lines, demonstrating its suitability to cellular heterogeneity determination.
Collapse
Affiliation(s)
- Manuel Matzinger
- Institute
of Molecular Pathology (IMP), Campus-Vienna-Biocenter 1, 1030 Vienna, Austria
| | - Elisabeth Müller
- Institute
of Molecular Pathology (IMP), Campus-Vienna-Biocenter 1, 1030 Vienna, Austria
| | - Gerhard Dürnberger
- Gregor
Mendel Institute of Molecular Plant Biology (GMI) of the Austrian
Academy of Sciences, Dr. Bohrgasse 3, 1030 Vienna, Austria
| | - Peter Pichler
- Institute
of Molecular Pathology (IMP), Campus-Vienna-Biocenter 1, 1030 Vienna, Austria
| | - Karl Mechtler
- Institute
of Molecular Pathology (IMP), Campus-Vienna-Biocenter 1, 1030 Vienna, Austria
- Institute
of Molecular Biotechnology of the Austrian Academy of Sciences, Dr. Bohrgasse 3, 1030 Vienna, Austria
- Gregor
Mendel Institute of Molecular Plant Biology (GMI) of the Austrian
Academy of Sciences, Dr. Bohrgasse 3, 1030 Vienna, Austria
| |
Collapse
|
19
|
Kazakova EM, Solovyeva EM, Levitsky LI, Bubis JA, Emekeeva DD, Antonets AA, Nazarov AA, Gorshkov MV, Tarasova IA. Proteomics-based scoring of cellular response to stimuli for improved characterization of signaling pathway activity. Proteomics 2023; 23:e2200275. [PMID: 36478387 DOI: 10.1002/pmic.202200275] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 11/04/2022] [Accepted: 11/25/2022] [Indexed: 12/12/2022]
Abstract
Omics technologies focus on uncovering the complex nature of molecular mechanisms in cells and organisms, including biomarkers and drug targets discovery. Aiming at these tasks, we see that information extracted from omics data is still underused. In particular, characteristics of differentially regulated molecules can be combined in a single score to quantify the signaling pathway activity. Such a metric can be useful for comprehensive data interpretation to follow: (1) developing molecular responses in time; (2) potency of a drug on a certain cell culture; (3) ranking the signaling pathway activity in stimulated cells; and (4) integration of the omics data and assay-based measurements of cell viability, cytotoxicity, and proliferation. With recent advances in ultrafast mass spectrometry for quantitative proteomics allowing data collection in a few minutes, proteomics score for cellular response to stimuli can become a fast, accurate, and informative complement to bioassays. Here, we utilized an interquartile-based selection of differentially regulated features and a variety of schemes for quantifying cellular responses to come up with the quantitative metric for total cellular response and pathway activity. Validation was performed using antiproliferative and virus assays and label-free proteomics data collected for cancer cells subjected to drug stimulation.
Collapse
Affiliation(s)
- Elizaveta M Kazakova
- V.L. Talrose Institute for Energy Problems of Chemical Physics, Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | - Elizaveta M Solovyeva
- V.L. Talrose Institute for Energy Problems of Chemical Physics, Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | - Lev I Levitsky
- V.L. Talrose Institute for Energy Problems of Chemical Physics, Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | - Julia A Bubis
- V.L. Talrose Institute for Energy Problems of Chemical Physics, Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | - Daria D Emekeeva
- V.L. Talrose Institute for Energy Problems of Chemical Physics, Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | - Anastasia A Antonets
- Department of Chemistry, M. V. Lomonosov Moscow State University, Moscow, Russia
| | - Alexey A Nazarov
- Department of Chemistry, M. V. Lomonosov Moscow State University, Moscow, Russia
| | - Mikhail V Gorshkov
- V.L. Talrose Institute for Energy Problems of Chemical Physics, Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | - Irina A Tarasova
- V.L. Talrose Institute for Energy Problems of Chemical Physics, Federal Research Center of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
20
|
Nickerson JL, Baghalabadi V, Rajendran SRCK, Jakubec PJ, Said H, McMillen TS, Dang Z, Doucette AA. Recent advances in top-down proteome sample processing ahead of MS analysis. MASS SPECTROMETRY REVIEWS 2023; 42:457-495. [PMID: 34047392 DOI: 10.1002/mas.21706] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 04/21/2021] [Accepted: 05/06/2021] [Indexed: 06/12/2023]
Abstract
Top-down proteomics is emerging as a preferred approach to investigate biological systems, with objectives ranging from the detailed assessment of a single protein therapeutic, to the complete characterization of every possible protein including their modifications, which define the human proteoform. Given the controlling influence of protein modifications on their biological function, understanding how gene products manifest or respond to disease is most precisely achieved by characterization at the intact protein level. Top-down mass spectrometry (MS) analysis of proteins entails unique challenges associated with processing whole proteins while maintaining their integrity throughout the processes of extraction, enrichment, purification, and fractionation. Recent advances in each of these critical front-end preparation processes, including minimalistic workflows, have greatly expanded the capacity of MS for top-down proteome analysis. Acknowledging the many contributions in MS technology and sample processing, the present review aims to highlight the diverse strategies that have forged a pathway for top-down proteomics. We comprehensively discuss the evolution of front-end workflows that today facilitate optimal characterization of proteoform-driven biology, including a brief description of the clinical applications that have motivated these impactful contributions.
Collapse
Affiliation(s)
| | - Venus Baghalabadi
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Subin R C K Rajendran
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
- Verschuren Centre for Sustainability in Energy and the Environment, Sydney, Nova Scotia, Canada
| | - Philip J Jakubec
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Hammam Said
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Teresa S McMillen
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Ziheng Dang
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Alan A Doucette
- Department of Chemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
21
|
Jiang Y, Hutton A, Cranney CW, Meyer JG. Label-Free Quantification from Direct Infusion Shotgun Proteome Analysis (DISPA-LFQ) with CsoDIAq Software. Anal Chem 2023; 95:677-685. [PMID: 36527718 PMCID: PMC9850400 DOI: 10.1021/acs.analchem.2c02249] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 10/31/2022] [Indexed: 12/23/2022]
Abstract
Large-scale proteome analysis requires rapid and high-throughput analytical methods. We recently reported a new paradigm in proteome analysis where direct infusion and ion mobility are used instead of liquid chromatography (LC) to achieve rapid and high-throughput proteome analysis. Here, we introduce an improved direct infusion shotgun proteome analysis protocol including label-free quantification (DISPA-LFQ) using CsoDIAq software. With CsoDIAq analysis of DISPA data, we can now identify up to ∼2000 proteins from the HeLa and 293T proteomes, and with DISPA-LFQ, we can quantify ∼1000 proteins from no more than 1 μg of sample within minutes. The identified proteins are involved in numerous valuable pathways including central carbon metabolism, nucleic acid replication and transport, protein synthesis, and endocytosis. Together with a high-throughput sample preparation method in a 96-well plate, we further demonstrate the utility of this technology for performing high-throughput drug analysis in human 293T cells. The total time for data collection from a whole 96-well plate is approximately 8 h. We conclude that the DISPA-LFQ strategy presents a valuable tool for fast identification and quantification of proteins in complex mixtures, which will power a high-throughput proteomic era of drug screening, biomarker discovery, and clinical analysis.
Collapse
Affiliation(s)
- Yuming Jiang
- Department
of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, United States
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
| | - Alexandre Hutton
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
| | - Caleb W. Cranney
- Department
of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, United States
| | - Jesse G. Meyer
- Department
of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, United States
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
| |
Collapse
|
22
|
Kliuchnikova AA, Novikova SE, Ilgisonis EV, Kiseleva OI, Poverennaya EV, Zgoda VG, Moshkovskii SA, Poroikov VV, Lisitsa AV, Archakov AI, Ponomarenko EA. Blood Plasma Proteome: A Meta-Analysis of the Results of Protein Quantification in Human Blood by Targeted Mass Spectrometry. Int J Mol Sci 2023; 24:ijms24010769. [PMID: 36614211 PMCID: PMC9821253 DOI: 10.3390/ijms24010769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/14/2022] [Accepted: 12/27/2022] [Indexed: 01/04/2023] Open
Abstract
A meta-analysis of the results of targeted quantitative screening of human blood plasma was performed to generate a reference standard kit that can be used for health analytics. The panel included 53 of the 296 proteins that form a “stable” part of the proteome of a healthy individual; these proteins were found in at least 70% of samples and were characterized by an interindividual coefficient of variation <40%. The concentration range of the selected proteins was 10−10−10−3 M and enrichment analysis revealed their association with rare familial diseases. The concentration of ceruloplasmin was reduced by approximately three orders of magnitude in patients with neurological disorders compared to healthy volunteers, and those of gelsolin isoform 1 and complement factor H were abruptly reduced in patients with lung adenocarcinoma. Absolute quantitative data of the individual proteome of a healthy and diseased individual can be used as the basis for personalized medicine and health monitoring. Storage over time allows us to identify individual biomarkers in the molecular landscape and prevent pathological conditions.
Collapse
Affiliation(s)
- Anna A. Kliuchnikova
- Institute of Biomedical Chemistry, 119121 Moscow, Russia
- Federal Research and Clinical Center of Physical-Chemical Medicine, 119435 Moscow, Russia
| | | | | | | | | | | | - Sergei A. Moshkovskii
- Federal Research and Clinical Center of Physical-Chemical Medicine, 119435 Moscow, Russia
- Department of Biochemistry, Medico-Biological Faculty, Pirogov Russian National Research Medical University, 117997 Moscow, Russia
| | | | | | | | | |
Collapse
|
23
|
Solovyeva EM, Bubis JA, Tarasova IA, Lobas AA, Ivanov MV, Nazarov AA, Shutkov IA, Gorshkov MV. On the Feasibility of Using an Ultra-Fast DirectMS1 Method of Proteome-Wide Analysis for Searching Drug Targets in Chemical Proteomics. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:1342-1353. [PMID: 36509723 DOI: 10.1134/s000629792211013x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Protein quantitation in tissue cells or physiological fluids based on liquid chromatography/mass spectrometry is one of the key sources of information on the mechanisms of cell functioning during chemotherapeutic treatment. Information on significant changes in protein expression upon treatment can be obtained by chemical proteomics and requires analysis of the cellular proteomes, as well as development of experimental and bioinformatic methods for identification of the drug targets. Low throughput of whole proteome analysis based on liquid chromatography and tandem mass spectrometry is one of the main factors limiting the scale of these studies. The method of direct mass spectrometric identification of proteins, DirectMS1, is one of the approaches developed in recent years allowing ultrafast proteome-wide analyses employing minute-scale gradients for separation of proteolytic mixtures. Aim of this work was evaluation of both possibilities and limitations of the method for identification of drug targets at the level of whole proteome and for revealing cellular processes activated by the treatment. Particularly, the available literature data on chemical proteomics obtained earlier for a large set of onco-pharmaceuticals using multiplex quantitative proteome profiling were analyzed. The results obtained were further compared with the proteome-wide data acquired by the DirectMS1 method using ultrashort separation gradients to evaluate efficiency of the method in identifying known drug targets. Using ovarian cancer cell line A2780 as an example, a whole-proteome comparison of two cell lysis techniques was performed, including the freeze-thaw lysis commonly employed in chemical proteomics and the one based on ultrasonication for cell disruption, which is the widely accepted as a standard in proteomic studies. Also, the proteome-wide profiling was performed using ultrafast DirectMS1 method for A2780 cell line treated with lonidamine, followed by gene ontology analyses to evaluate capabilities of the method in revealing regulation of proteins in the cellular processes associated with drug treatment.
Collapse
Affiliation(s)
- Elizaveta M Solovyeva
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Julia A Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Irina A Tarasova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Anna A Lobas
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Mark V Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Alexey A Nazarov
- Faculty of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Ilya A Shutkov
- Faculty of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia.
| |
Collapse
|
24
|
Viral Biomarker Detection and Validation Using MALDI Mass Spectrometry Imaging (MSI). Proteomes 2022; 10:proteomes10030033. [PMID: 36136311 PMCID: PMC9506211 DOI: 10.3390/proteomes10030033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 08/18/2022] [Accepted: 09/05/2022] [Indexed: 11/21/2022] Open
Abstract
(1) Background: MALDI imaging is a technique that still largely depends on time of flight (TOF)-based instrument such as the Bruker UltrafleXtreme. While capable of performing targeted MS/MS, these instruments are unable to perform fragmentation while imaging a tissue section necessitating the reliance of MS1 values for peptide level identifications. With this premise in mind, we have developed a hybrid bioinformatic/image-based method for the identification and validation of viral biomarkers. (2) Methods: Formalin-Fixed Paraffin-Embedded (FFPE) mouse samples were sectioned, mounted and prepared for mass spectrometry imaging using our well-established methods. Peptide identification was achieved by first extracting confident images corresponding to theoretical viral peptides. Next, those masses were used to perform a Peptide Mmass Fingerprint (PMF) searched against known viral FASTA sequences against a background mouse FASTA database. Finally, a correlational analysis was performed with imaging data to confirm pixel-by-pixel colocalization and intensity of viral peptides. (3) Results: 14 viral peptides were successfully identified with significant PMF Scores and a correlational result of >0.79 confirming the presence of the virus and distinguishing it from the background mouse proteins. (4) Conclusions: this novel approach leverages the power of mass spectrometry imaging and provides confident identifications for viral proteins without requiring MS/MS using simple MALDI Time Of Flight/Time Of Flight (TOF/TOF) instrumentation.
Collapse
|
25
|
Ivanov MV, Bubis JA, Gorshkov V, Tarasova IA, Levitsky LI, Solovyeva EM, Lipatova AV, Kjeldsen F, Gorshkov MV. DirectMS1Quant: Ultrafast Quantitative Proteomics with MS/MS-Free Mass Spectrometry. Anal Chem 2022; 94:13068-13075. [PMID: 36094425 DOI: 10.1021/acs.analchem.2c02255] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Recently, we presented the DirectMS1 method of ultrafast proteome-wide analysis based on minute-long LC gradients and MS1-only mass spectra acquisition. Currently, the method provides the depth of human cell proteome coverage of 2500 proteins at a 1% false discovery rate (FDR) when using 5 min LC gradients and 7.3 min runtime in total. While the standard MS/MS approaches provide 4000-5000 protein identifications within a couple of hours of instrumentation time, we advocate here that the higher number of identified proteins does not always translate into better quantitation quality of the proteome analysis. To further elaborate on this issue, we performed a one-on-one comparison of quantitation results obtained using DirectMS1 with three popular MS/MS-based quantitation methods: label-free (LFQ) and tandem mass tag quantitation (TMT), both based on data-dependent acquisition (DDA) and data-independent acquisition (DIA). For comparison, we performed a series of proteome-wide analyses of well-characterized (ground truth) and biologically relevant samples, including a mix of UPS1 proteins spiked at different concentrations into an Echerichia coli digest used as a background and a set of glioblastoma cell lines. MS1-only data was analyzed using a novel quantitation workflow called DirectMS1Quant developed in this work. The results obtained in this study demonstrated comparable quantitation efficiency of 5 min DirectMS1 with both TMT and DIA methods, yet the latter two utilized a 10-20-fold longer instrumentation time.
Collapse
Affiliation(s)
- Mark V Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Julia A Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Vladimir Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Irina A Tarasova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Lev I Levitsky
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Elizaveta M Solovyeva
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Anastasiya V Lipatova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Frank Kjeldsen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center of Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| |
Collapse
|
26
|
Abstract
Metaproteomics is used to explore the functional dynamics of microbial communities. However, acquiring metaproteomic data by tandem mass spectrometry (MS/MS) is time-consuming and resource-intensive, and there is a demand for computational methods that can be used to reduce these resource requirements. We present MetaProClust-MS1, a computational framework for microbiome feature screening developed to prioritize samples for follow-up MS/MS. In this proof-of-concept study, we tested and compared MetaProClust-MS1 results on gut microbiome data, from fecal samples, acquired using short 15-min MS1-only chromatographic gradients and MS1 spectra from longer 60-min gradients to MS/MS-acquired data. We found that MetaProClust-MS1 identified robust gut microbiome responses caused by xenobiotics with significantly correlated cluster topologies of comparable data sets. We also used MetaProClust-MS1 to reanalyze data from both a clinical MS/MS diagnostic study of pediatric patients with inflammatory bowel disease and an experiment evaluating the therapeutic effects of a small molecule on the brain tissue of Alzheimer's disease mouse models. MetaProClust-MS1 clusters could distinguish between inflammatory bowel disease diagnoses (ulcerative colitis and Crohn's disease) using samples from mucosal luminal interface samples and identified hippocampal proteome shifts of Alzheimer's disease mouse models after small-molecule treatment. Therefore, we demonstrate that MetaProClust-MS1 can screen both microbiomes and single-species proteomes using only MS1 profiles, and our results suggest that this approach may be generalizable to any proteomics experiment. MetaProClust-MS1 may be especially useful for large-scale metaproteomic screening for the prioritization of samples for further metaproteomic characterization, using MS/MS, for instance, in addition to being a promising novel approach for clinical diagnostic screening. IMPORTANCE Growing evidence suggests that human gut microbiome composition and function are highly associated with health and disease. As such, high-throughput metaproteomic studies are becoming more common in gut microbiome research. However, using a conventional long liquid chromatography (LC)-MS/MS gradient metaproteomics approach as an initial screen in large-scale microbiome experiments can be slow and expensive. To combat this challenge, we introduce MetaProClust-MS1, a computational framework for microbiome screening using MS1-only profiles. In this proof-of-concept study, we show that MetaProClust-MS1 identifies clusters of gut microbiome treatments using MS1-only profiles similar to those identified using MS/MS. Our approach allows researchers to prioritize samples and treatments of interest for further metaproteomic analyses and may be generally applicable to any proteomic analysis. In particular, this approach may be especially useful for large-scale metaproteomic screening or in clinical settings where rapid diagnostic evidence is required.
Collapse
|
27
|
Zaikin VG, Borisov RS. Mass Spectrometry as a Crucial Analytical Basis for Omics Sciences. JOURNAL OF ANALYTICAL CHEMISTRY 2021. [PMCID: PMC8693159 DOI: 10.1134/s1061934821140094] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
This review is devoted to the consideration of mass spectrometric platforms as applied to omics sciences. The most significant attention is paid to omics related to life sciences (genomics, proteomics, meta-bolomics, lipidomics, glycomics, plantomics, etc.). Mass spectrometric approaches to solving the problems of petroleomics, polymeromics, foodomics, humeomics, and exosomics, related to inorganic sciences, are also discussed. The review comparatively presents the advantages of various principles of separation and mass spectral techniques, complementary derivatization, used to obtain large arrays of various structural and quantitative information in the mentioned omics sciences.
Collapse
Affiliation(s)
- V. G. Zaikin
- Topchiev Institute of Petrochemical Synthesis, Russian Academy of Sciences, 119991 Moscow, Russia
| | - R. S. Borisov
- Topchiev Institute of Petrochemical Synthesis, Russian Academy of Sciences, 119991 Moscow, Russia
- RUDN University, 117198 Moscow, Russia
- Core Facility Center “Arktika,” Northern (Arctic) Federal University, 163002 Arkhangelsk, Russia
| |
Collapse
|
28
|
Hruska M, Holub D. Evaluation of an integrative Bayesian peptide detection approach on a combinatorial peptide library. EUROPEAN JOURNAL OF MASS SPECTROMETRY (CHICHESTER, ENGLAND) 2021; 27:217-234. [PMID: 34989269 DOI: 10.1177/14690667211066725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Detection of peptides lies at the core of bottom-up proteomics analyses. We examined a Bayesian approach to peptide detection, integrating match-based models (fragments, retention time, isotopic distribution, and precursor mass) and peptide prior probability models under a unified probabilistic framework. To assess the relevance of these models and their various combinations, we employed a complete- and a tail-complete search of a low-precursor-mass synthetic peptide library based on oncogenic KRAS peptides. The fragment match was by far the most informative match-based model, while the retention time match was the only remaining such model with an appreciable impact--increasing correct detections by around 8 %. A peptide prior probability model built from a reference proteome greatly improved the detection over a uniform prior, essentially transforming de novo sequencing into a reference-guided search. The knowledge of a correct sequence tag in advance to peptide-spectrum matching had only a moderate impact on peptide detection unless the tag was long and of high certainty. The approach also derived more precise error rates on the analyzed combinatorial peptide library than those estimated using PeptideProphet and Percolator, showing its potential applicability for the detection of homologous peptides. Although the approach requires further computational developments for routine data analysis, it illustrates the value of peptide prior probabilities and presents a Bayesian approach for their incorporation into peptide detection.
Collapse
Affiliation(s)
- Miroslav Hruska
- Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, 98735Palacky University, Olomouc, Czech Republic
- Department of Computer Science, Faculty of Science, 98735Palacky University, Olomouc, Czech Republic
| | - Dusan Holub
- Institute of Molecular and Translational Medicine, Faculty of Medicine and Dentistry, 98735Palacky University, Olomouc, Czech Republic
| |
Collapse
|
29
|
Orsburn BC. Evaluation of the Sensitivity of Proteomics Methods Using the Absolute Copy Number of Proteins in a Single Cell as a Metric. Proteomes 2021; 9:34. [PMID: 34287363 PMCID: PMC8293326 DOI: 10.3390/proteomes9030034] [Citation(s) in RCA: 145] [Impact Index Per Article: 36.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 07/13/2021] [Accepted: 07/15/2021] [Indexed: 01/24/2023] Open
Abstract
Proteomic technology has improved at a staggering pace in recent years, with even practitioners challenged to keep up with new methods and hardware. The most common metric used for method performance is the number of peptides and proteins identified. While this metric may be helpful for proteomics researchers shopping for new hardware, this is often not the most biologically relevant metric. Biologists often utilize proteomics in the search for protein regulators that are of a lower relative copy number in the cell. In this review, I re-evaluate untargeted proteomics data using a simple graphical representation of the absolute copy number of proteins present in a single cancer cell as a metric. By comparing single-shot proteomics data to the coverage of the most in-depth proteomic analysis of that cell line acquired to date, we can obtain a rapid metric of method performance. Using a simple copy number metric allows visualization of how proteomics has developed in both sensitivity and overall dynamic range when using both relatively long and short acquisition times. To enable reanalysis beyond what is presented here, two available web applications have been developed for single- and multi-experiment comparisons with reference protein copy number data for multiple cell lines and organisms.
Collapse
Affiliation(s)
- Benjamin C Orsburn
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD 21205, USA
| |
Collapse
|
30
|
Ivanov MV, Solovyeva EM, Bubis JA, Gorshkov MV. Improving the Protein Inference from Bottom-Up Proteomic Data Using Identifications from MS1 Spectra. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:1258-1262. [PMID: 33900766 DOI: 10.1021/jasms.1c00061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Protein inference is one of the crucial steps in proteome characterization using a bottom-up approach. Multiple algorithms to solve the problem are focused on extensive analysis of shared peptides identified from fragmentation mass spectra (MS/MS). However, many protein homologues with a similar amino acid sequence typically have identical lists of identified peptides due to the problem of proteome undersampling in a bottom-up approach and, thus, cannot be distinguished by existing protein inference methods. Here, we propose the use of peptide feature information extracted from precursor mass spectra to assist in identification of proteins otherwise indistinguishable from MS/MS. The proposed method was integrated with a protein inference algorithm based on the parsimony principle and built-in in the postsearch utility Scavager. The results demonstrate increasing accuracy and efficiency of homologous protein identifications for the well characterized data sets including the one with known protein sequences from iPRG-2016 study.
Collapse
Affiliation(s)
- Mark V Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Building 2, Moscow 119334, Russia
| | - Elizaveta M Solovyeva
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Building 2, Moscow 119334, Russia
| | - Julia A Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Building 2, Moscow 119334, Russia
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Building 2, Moscow 119334, Russia
| |
Collapse
|
31
|
Ivanov MV, Bubis JA, Gorshkov V, Abdrakhimov DA, Kjeldsen F, Gorshkov MV. Boosting MS1-only Proteomics with Machine Learning Allows 2000 Protein Identifications in Single-Shot Human Proteome Analysis Using 5 min HPLC Gradient. J Proteome Res 2021; 20:1864-1873. [PMID: 33720732 DOI: 10.1021/acs.jproteome.0c00863] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Proteome-wide analyses rely on tandem mass spectrometry and the extensive separation of proteolytic mixtures. This imposes considerable instrumental time consumption, which is one of the main obstacles in the broader acceptance of proteomics in biomedical and clinical research. Recently, we presented a fast proteomic method termed DirectMS1 based on ultrashort LC gradients as well as MS1-only mass spectra acquisition and data processing. The method allows significant reduction of the proteome-wide analysis time to a few minutes at the depth of quantitative proteome coverage of 1000 proteins at 1% false discovery rate (FDR). In this work, to further increase the capabilities of the DirectMS1 method, we explored the opportunities presented by the recent progress in the machine-learning area and applied the LightGBM decision tree boosting algorithm to the scoring of peptide feature matches when processing MS1 spectra. Furthermore, we integrated the peptide feature identification algorithm of DirectMS1 with the recently introduced peptide retention time prediction utility, DeepLC. Additional approaches to improve the performance of the DirectMS1 method are discussed and demonstrated, such as using FAIMS for gas-phase ion separation. As a result of all improvements to DirectMS1, we succeeded in identifying more than 2000 proteins at 1% FDR from the HeLa cell line in a 5 min gradient LC-FAIMS/MS1 analysis. The data sets generated and analyzed during the current study have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the data set identifier PXD023977.
Collapse
Affiliation(s)
- Mark V Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
| | - Julia A Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
| | - Vladimir Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Daniil A Abdrakhimov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia.,Moscow Institute of Physics and Technology, Institutsky lane 9, Dolgoprudny, Moscow Region 141700, Russia
| | - Frank Kjeldsen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
| |
Collapse
|
32
|
Abdrakhimov DA, Bubis JA, Gorshkov V, Kjeldsen F, Gorshkov MV, Ivanov MV. Biosaur: An open-source Python software for liquid chromatography-mass spectrometry peptide feature detection with ion mobility support. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2021:e9045. [PMID: 33450063 DOI: 10.1002/rcm.9045] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 12/20/2020] [Accepted: 01/04/2021] [Indexed: 06/12/2023]
Abstract
RATIONALE One of the important steps in initial data processing of peptide mass spectra is the detection of peptide features in full-range mass spectra. Ion mobility offers advantages over previous methods performing this detection by providing an additional structure-specific separation dimension. However, there is a lack of open-source software that utilizes these advantages and detects peptide features in mass spectra acquired along with ion mobility data using new instruments such as timsTOF and/or FAIMS-Orbitrap. METHODS Recently, a utility called Dinosaur was presented, which provides an efficient way for feature detection in peptide ion mass spectra. In this work we extended its functionality by developing Biosaur software to fully employ the additional information provided by ion mobility data. Biosaur was developed using the Python 3.8 programming language. RESULTS Biosaur supports the processing of data acquired using mass spectrometers with ion mobility capabilities, specifically timsTOF and FAIMS. In addition, it processes mass spectra obtained in negative ion mode and reports cosine correlation table for peptide features which is useful for differentiation between in-source fragments and semi-tryptic peptides. CONCLUSIONS Biosaur is a utility for detecting peptide features in liquid chromatography-mass spectra with ion mobility and negative ion supports. The software is distributed with an open-source APACHE 2.0 license and is freely available on Github: https://github.com/abdrakhimov1/Biosaur.
Collapse
Affiliation(s)
- Daniil A Abdrakhimov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow, 119334, Russia
- Moscow Institute of Physics and Technology, National Research University, G. Dolgoprudny, Institutsky Lane 9, Dolgoprudnyj, RU, 141701, Russia
| | - Julia A Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow, 119334, Russia
| | - Vladimir Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, DK-5230, Denmark
| | - Frank Kjeldsen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, DK-5230, Denmark
| | - Mikhail V Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow, 119334, Russia
| | - Mark V Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow, 119334, Russia
| |
Collapse
|
33
|
Quantitative shotgun proteome analysis by direct infusion. Nat Methods 2020; 17:1222-1228. [PMID: 33230323 PMCID: PMC8009190 DOI: 10.1038/s41592-020-00999-z] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 10/21/2020] [Indexed: 11/15/2022]
Abstract
Liquid chromatography mass spectrometry (LC-MS) delivers sensitive peptide analysis for proteomics, but the methodology requires extensive analysis time, hampering throughput. Here, we demonstrate that using gas-phase peptide separation instead of LC enables fast proteome analysis. Using Direct Infusion – Shotgun Proteome Analysis (DI-SPA) by data-independent acquisition mass spectrometry (DIA-MS), we demonstrate the targeted quantification of over 500 proteins within minutes of MS data collection (~3.5 proteins/second). We show the utility of this technology to perform a complex multifactorial proteome study of interactions between nutrients, genotype, and mitochondrial toxins in a collection of cultured human cells. More than 45,000 quantitative protein measurements from 132 samples were achieved in only 4.4 hours of MS data collection. Enabling fast, unbiased proteome quantification without LC, DI-SPA offers an approach to boosting throughput critical to drug and biomarker discovery studies that require analysis of thousands of proteomes.
Collapse
|
34
|
Möginger U, Marcussen N, Jensen ON. Histo-molecular differentiation of renal cancer subtypes by mass spectrometry imaging and rapid proteome profiling of formalin-fixed paraffin-embedded tumor tissue sections. Oncotarget 2020; 11:3998-4015. [PMID: 33216824 PMCID: PMC7646834 DOI: 10.18632/oncotarget.27787] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Accepted: 10/10/2020] [Indexed: 12/24/2022] Open
Abstract
Pathology differentiation of renal cancer types is challenging due to tissue similarities or overlapping histological features of various tumor (sub) types. As assessment is often manually conducted outcomes can be prone to human error and therefore require high-level expertise and experience. Mass spectrometry can provide detailed histo-molecular information on tissue and is becoming increasingly popular in clinical settings. Spatially resolving technologies such as mass spectrometry imaging and quantitative microproteomics profiling in combination with machine learning approaches provide promising tools for automated tumor classification of clinical tissue sections. In this proof of concept study we used MALDI-MS imaging (MSI) and rapid LC-MS/MS-based microproteomics technologies (15 min/sample) to analyze formalin-fixed paraffin embedded (FFPE) tissue sections and classify renal oncocytoma (RO, n = 11), clear cell renal cell carcinoma (ccRCC, n = 12) and chromophobe renal cell carcinoma (ChRCC, n = 5). Both methods were able to distinguish ccRCC, RO and ChRCC in cross-validation experiments. MSI correctly classified 87% of the patients whereas the rapid LC-MS/MS-based microproteomics approach correctly classified 100% of the patients. This strategy involving MSI and rapid proteome profiling by LC-MS/MS reveals molecular features of tumor sections and enables cancer subtype classification. Mass spectrometry provides a promising complementary approach to current pathological technologies for precise digitized diagnosis of diseases.
Collapse
Affiliation(s)
- Uwe Möginger
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Odense, Denmark.,Present address: Global Research Technologies, Novo Nordisk A/S, Novo Nordisk Park, Bagsværd, Denmark
| | - Niels Marcussen
- Institute for Pathology, Odense University Hospital, Odense, Denmark
| | - Ole N Jensen
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
35
|
Mun DG, Renuse S, Saraswat M, Madugundu A, Udainiya S, Kim H, Park SKR, Zhao H, Nirujogi RS, Na CH, Kannan N, Yates JR, Lee SW, Pandey A. PASS-DIA: A Data-Independent Acquisition Approach for Discovery Studies. Anal Chem 2020; 92:14466-14475. [PMID: 33079518 DOI: 10.1021/acs.analchem.0c02513] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
A data-independent acquisition (DIA) approach is being increasingly adopted as a promising strategy for identification and quantitation of proteomes. As most DIA data sets are acquired with wide isolation windows, highly complex MS/MS spectra are generated, which negatively impacts obtaining peptide information through classical protein database searches. Therefore, the analysis of DIA data mainly relies on the evidence of the existence of peptides from prebuilt spectral libraries. Consequently, one major weakness of this method is that it does not account for peptides that are not included in the spectral library, precluding the use of DIA for discovery studies. Here, we present a strategy termed Precursor ion And Small Slice-DIA (PASS-DIA) in which MS/MS spectra are acquired with small isolation windows (slices) and MS/MS spectra are interpreted with accurately determined precursor ion masses. This method enables the direct application of conventional spectrum-centric analysis pipelines for peptide identification and precursor ion-based quantitation. The performance of PASS-DIA was observed to be superior to both data-dependent acquisition (DDA) and conventional DIA experiments with 69 and 48% additional protein identifications, respectively. Application of PASS-DIA for the analysis of post-translationally modified peptides again highlighted its superior performance in characterizing phosphopeptides (77% more), N-terminal acetylated peptides (56% more), and N-glycopeptides (83% more) as compared to DDA alone. Finally, the use of PASS-DIA to characterize a rare proteome of human fallopian tube organoids enabled 34% additional protein identifications than DDA alone and revealed biologically relevant pathways including low abundance proteins. Overall, PASS-DIA is a novel DIA approach for use as a discovery tool that outperforms both conventional DDA and DIA experiments to provide additional protein information. We believe that the PASS-DIA method is an important strategy for discovery-type studies when deeper proteome characterization is required.
Collapse
Affiliation(s)
- Dong-Gi Mun
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States
| | - Santosh Renuse
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota 55905, United States
| | - Mayank Saraswat
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Molecular Medicine, National Institute of Mental Health and Neurosciences (NIMHANS), Hosur Road, Bangalore 560029, India.,Institute of Bioinformatics, International Technology Park, Bangalore 560066, Karnataka, India.,Manipal Academy of Higher Education (MAHE), Manipal 576104 Karnataka, India
| | - Anil Madugundu
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Molecular Medicine, National Institute of Mental Health and Neurosciences (NIMHANS), Hosur Road, Bangalore 560029, India.,Institute of Bioinformatics, International Technology Park, Bangalore 560066, Karnataka, India.,Manipal Academy of Higher Education (MAHE), Manipal 576104 Karnataka, India
| | - Savita Udainiya
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Molecular Medicine, National Institute of Mental Health and Neurosciences (NIMHANS), Hosur Road, Bangalore 560029, India
| | - Hokeun Kim
- Department of Chemistry, Center for Proteogenome Research, Korea University, Seoul 136-701, Republic of Korea
| | - Sung-Kyu Robin Park
- Department of Molecular Medicine and Neurobiology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Hui Zhao
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States
| | - Raja Sekhar Nirujogi
- Medical Research Council Protein Phosphorylation and Ubiquitylation Unit, University of Dundee, Dundee DD1 5EH, United Kingdom
| | - Chan Hyun Na
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, United States.,Neurology, Institute for Cell Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, United States
| | - Nagarajan Kannan
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States
| | - John R Yates
- Department of Molecular Medicine and Neurobiology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Sang-Won Lee
- Department of Chemistry, Center for Proteogenome Research, Korea University, Seoul 136-701, Republic of Korea
| | - Akhilesh Pandey
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Individualized Medicine, Mayo Clinic, Rochester, Minnesota 55905, United States.,Center for Molecular Medicine, National Institute of Mental Health and Neurosciences (NIMHANS), Hosur Road, Bangalore 560029, India.,Institute of Bioinformatics, International Technology Park, Bangalore 560066, Karnataka, India.,Manipal Academy of Higher Education (MAHE), Manipal 576104 Karnataka, India
| |
Collapse
|