1
|
Wang J, Tan H, Fu Y, Mishra A, Sun H, Wang Z, Wu Z, Wang X, Serrano GE, Beach TG, Peng J, High AA. Evaluation of Protein Identification and Quantification by the diaPASEF Method on timsTOF SCP. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1253-1260. [PMID: 38754071 DOI: 10.1021/jasms.4c00067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2024]
Abstract
Accurate and precise quantification is crucial in modern proteomics, particularly in the context of exploring low-amount samples. While the innovative 4D-data-independent acquisition (DIA) quantitative proteomics facilitated by timsTOF mass spectrometers gives enhanced sensitivity and selectivity for protein identification, the diaPASEF (parallel accumulation-serial fragmentation combined with data-independent acquisition) parameters have not been systematically optimized, and a comprehensive evaluation of the quantification is currently lacking. In this study, we conducted a thorough optimization of key parameters on a timsTOF SCP instrument, including sample loading amount (50 ng), ramp/accumulation time (140 ms), isolation window width (20 m/z), and gradient time (60 min). To further improve the identification of proteins in low-amount samples, we utilized different column settings and introduced 0.02% n-dodecyl-β-d-maltoside (DDM) in the sample reconstitution solution, resulting in a remarkable 19-fold increase in protein identification at the single-cell-equivalent level. Moreover, a comprehensive comparison of protein quantification using a tandem mass tag reporter (TMT-reporter), complement TMT ions (TMTc), and diaPASEF revealed a strong correlation between these methods. Both diaPASEF and TMTc have effectively addressed the issue of ratio compression, highlighting the diaPASEF method's effectiveness in achieving accurate quantification data compared to TMT reporter quantification. Additionally, an in-depth analysis of in-group variation positioned diaPASEF between the TMT-reporter and TMTc methods. Therefore, diaPASEF quantification on the timsTOF SCP instrument emerges as a precise and accurate methodology for quantitative proteomics, especially for samples with small amounts.
Collapse
Affiliation(s)
- Ju Wang
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Haiyan Tan
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Yingxue Fu
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Ashutosh Mishra
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Huan Sun
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Zhen Wang
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Zhiping Wu
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Xusheng Wang
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Geidy E Serrano
- Banner Sun Health Research Institute, Sun City, Arizona 85351, United States
| | - Thomas G Beach
- Banner Sun Health Research Institute, Sun City, Arizona 85351, United States
| | - Junmin Peng
- Departments of Structural Biology and Developmental Neurobiology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Anthony A High
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| |
Collapse
|
2
|
Stincone P, Naimi A, Saviola AJ, Reher R, Petras D. Decoding the molecular interplay in the central dogma: An overview of mass spectrometry-based methods to investigate protein-metabolite interactions. Proteomics 2024; 24:e2200533. [PMID: 37929699 DOI: 10.1002/pmic.202200533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 10/15/2023] [Accepted: 10/23/2023] [Indexed: 11/07/2023]
Abstract
With the emergence of next-generation nucleotide sequencing and mass spectrometry-based proteomics and metabolomics tools, we have comprehensive and scalable methods to analyze the genes, transcripts, proteins, and metabolites of a multitude of biological systems. Despite the fascinating new molecular insights at the genome, transcriptome, proteome and metabolome scale, we are still far from fully understanding cellular organization, cell cycles and biology at the molecular level. Significant advances in sensitivity and depth for both sequencing as well as mass spectrometry-based methods allow the analysis at the single cell and single molecule level. At the same time, new tools are emerging that enable the investigation of molecular interactions throughout the central dogma of molecular biology. In this review, we provide an overview of established and recently developed mass spectrometry-based tools to probe metabolite-protein interactions-from individual interaction pairs to interactions at the proteome-metabolome scale.
Collapse
Affiliation(s)
- Paolo Stincone
- University of Tuebingen, CMFI Cluster of Excellence, Interfaculty Institute of Microbiology and Infection Medicine, Tuebingen, Germany
- University of Tuebingen, Center for Plant Molecular Biology, Tuebingen, Germany
| | - Amira Naimi
- University of Marburg, Institute of Pharmaceutical Biology and Biotechnology, Marburg, Germany
| | | | - Raphael Reher
- University of Marburg, Institute of Pharmaceutical Biology and Biotechnology, Marburg, Germany
| | - Daniel Petras
- University of Tuebingen, CMFI Cluster of Excellence, Interfaculty Institute of Microbiology and Infection Medicine, Tuebingen, Germany
- University of California Riverside, Department of Biochemistry, Riverside, USA
| |
Collapse
|
3
|
Peng H, Wang H, Kong W, Li J, Goh WWB. Optimizing differential expression analysis for proteomics data via high-performing rules and ensemble inference. Nat Commun 2024; 15:3922. [PMID: 38724498 PMCID: PMC11082229 DOI: 10.1038/s41467-024-47899-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 04/16/2024] [Indexed: 05/12/2024] Open
Abstract
Identification of differentially expressed proteins in a proteomics workflow typically encompasses five key steps: raw data quantification, expression matrix construction, matrix normalization, missing value imputation (MVI), and differential expression analysis. The plethora of options in each step makes it challenging to identify optimal workflows that maximize the identification of differentially expressed proteins. To identify optimal workflows and their common properties, we conduct an extensive study involving 34,576 combinatoric experiments on 24 gold standard spike-in datasets. Applying frequent pattern mining techniques to top-ranked workflows, we uncover high-performing rules that demonstrate optimality has conserved properties. Via machine learning, we confirm optimal workflows are indeed predictable, with average cross-validation F1 scores and Matthew's correlation coefficients surpassing 0.84. We introduce an ensemble inference to integrate results from individual top-performing workflows for expanding differential proteome coverage and resolve inconsistencies. Ensemble inference provides gains in pAUC (up to 4.61%) and G-mean (up to 11.14%) and facilitates effective aggregation of information across varied quantification approaches such as topN, directLFQ, MaxLFQ intensities, and spectral counts. However, further development and evaluation are needed to establish acceptable frameworks for conducting ensemble inference on multiple proteomics workflows.
Collapse
Affiliation(s)
- Hui Peng
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - He Wang
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Weijia Kong
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Jinyan Li
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
| | - Wilson Wen Bin Goh
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.
- Center for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore.
- Center of AI in Medicine, Nanyang Technological University, Singapore, Singapore.
- Division of Neurology, Department of Brain Sciences, Faculty of Medicine, Imperial College London, London, UK.
| |
Collapse
|
4
|
Truong T, Kelly RT. What's new in single-cell proteomics. Curr Opin Biotechnol 2024; 86:103077. [PMID: 38359605 PMCID: PMC11068367 DOI: 10.1016/j.copbio.2024.103077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 01/19/2024] [Indexed: 02/17/2024]
Abstract
In recent years, single-cell proteomics (SCP) has advanced significantly, enabling the analysis of thousands of proteins within single mammalian cells. This progress is driven by advances in experimental design, with maturing label-free and multiplexed methods, optimized sample preparation, and innovations in separation techniques, including ultra-low-flow nanoLC. These factors collectively contribute to improved sensitivity, throughput, and reproducibility. Cutting-edge mass spectrometry platforms and data acquisition approaches continue to play a critical role in enhancing data quality. Furthermore, the exploration of spatial proteomics with single-cell resolution offers significant promise for understanding cellular interactions, giving rise to various phenotypes. SCP has far-reaching applications in cancer research, biomarker discovery, and developmental biology. Here, we provide a critical review of recent advances in the field of SCP.
Collapse
Affiliation(s)
- Thy Truong
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT 84602, United States
| | - Ryan T Kelly
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT 84602, United States.
| |
Collapse
|
5
|
Wu Z, Huang X, Huang L, Zhang X. 102-Plex Approach for Accurate and Multiplexed Proteome Quantification. Anal Chem 2024; 96:1402-1409. [PMID: 38215345 DOI: 10.1021/acs.analchem.3c03036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2024]
Abstract
Hyperplexing approaches have been aimed to meet the demand for large-scale proteomic analyses. Currently, the analysis capacity has expanded to up to 54 samples within a single experiment by utilizing different isotopic and isobaric reagent combinations. In this report, we propose a super multiplexed approach to enable the analysis of up to 102 samples in a single experiment, by the combination of our recently developed TAG-TMTpro and TAG-IBT16 labeling. We systematically investigated the identification and quantification performance of the 102-plex approach using the mixtures of E. coli and HeLa peptides. Our results revealed that all labeling series demonstrated accurate and reliable quantification performance. The combination of TAG-IBT16 and TAG-TMTpro approaches expands the multiplexing capacity to 102 plexes, providing a more multiplexed quantification method for even larger-scale proteomic analysis. Data are available via ProteomeXchange with the identifier PXD042398.
Collapse
Affiliation(s)
- Zhen Wu
- State Key Laboratory of Genetic Engineering, Department of Biochemistry and Biophysics, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Xirui Huang
- State Key Laboratory of Genetic Engineering, Department of Biochemistry and Biophysics, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Lin Huang
- State Key Laboratory of Genetic Engineering, Department of Biochemistry and Biophysics, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Xumin Zhang
- State Key Laboratory of Genetic Engineering, Department of Biochemistry and Biophysics, School of Life Sciences, Fudan University, Shanghai 200438, China
| |
Collapse
|
6
|
Zheng R, Matzinger M, Mayer RL, Valenta A, Sun X, Mechtler K. A High-Sensitivity Low-Nanoflow LC-MS Configuration for High-Throughput Sample-Limited Proteomics. Anal Chem 2023; 95:18673-18678. [PMID: 38088903 PMCID: PMC10753523 DOI: 10.1021/acs.analchem.3c03058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 11/16/2023] [Accepted: 11/21/2023] [Indexed: 12/27/2023]
Abstract
This work demonstrates the utility of high-throughput nanoLC-MS and label-free quantification (LFQ) for sample-limited bottom-up proteomics analysis, including single-cell proteomics (SCP). Conditions were optimized on a 50 μm internal diameter (I.D.) column operated at 100 nL/min in the direct injection workflow to balance method sensitivity and sample throughput from 24 to 72 samples/day. Multiple data acquisition strategies were also evaluated for proteome coverage, including data-dependent acquisition (DDA), wide-window acquisition (WWA), and wide-window data-independent acquisition (WW-DIA). Analyzing 250 pg HeLa digest with a 10-min LC gradient (72 samples/day) provided >900, >1,800, and >3,000 protein group identifications for DDA, WWA, and WW-DIA, respectively. Total method cycle time was further reduced from 20 to 14.4 min (100 samples/day) by employing a trap-and-elute workflow, enabling 70% mass spectrometer utilization. The method was applied to library-free DIA analysis of single-cell samples, yielding >1,700 protein groups identified. In conclusion, this study provides a high-sensitivity, high-throughput nanoLC-MS configuration for sample-limited proteomics.
Collapse
Affiliation(s)
- Runsheng Zheng
- Thermo
Fisher Scientific, Dornier Str. 4, 82110 Germering, Germany
| | - Manuel Matzinger
- IMP—Institute
of Molecular Pathology, Campus-Vienna-Biocenter 1, A-1030 Vienna, Austria
| | - Rupert L. Mayer
- IMP—Institute
of Molecular Pathology, Campus-Vienna-Biocenter 1, A-1030 Vienna, Austria
| | - Alec Valenta
- Thermo
Fisher Scientific, Dornier Str. 4, 82110 Germering, Germany
| | - Xuefei Sun
- Thermo
Fisher Scientific, 1228 Titan Way, Sunnyvale, California 94085, United States
| | - Karl Mechtler
- IMP—Institute
of Molecular Pathology, Campus-Vienna-Biocenter 1, A-1030 Vienna, Austria
- IMBA—Institute
of Molecular Biotechnology of the Austrian Academy of Sciences, Dr. Bohr Gasse 3, A-1030 Vienna, Austria
- Gregor
Mendel Institute of Molecular Plant Biology of the Austrian Academy
of Sciences, Dr. Bohr
Gasse 3, A-1030 Vienna, Austria
| |
Collapse
|
7
|
Orsburn BC. Metabolomic, Proteomic, and Single-Cell Proteomic Analysis of Cancer Cells Treated with the KRAS G12D Inhibitor MRTX1133. J Proteome Res 2023; 22:3703-3713. [PMID: 37983312 PMCID: PMC10696623 DOI: 10.1021/acs.jproteome.3c00212] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Indexed: 11/22/2023]
Abstract
Mutations in KRAS are common drivers of human cancers and are often those with the poorest overall prognosis for patients. A recently developed compound, MRTX1133, has shown promise in inhibiting the activity of KRASG12D mutant proteins, which is one of the main drivers of pancreatic cancer. To better understand the mechanism of action of this compound, I performed both proteomics and metabolomics on four KRASG12D mutant pancreatic cancer cell lines. To obtain increased granularity in the proteomic observations, single-cell proteomics was successfully performed on two of these lines. Following quality filtering, a total of 1498 single cells were analyzed. From these cells, 3140 total proteins were identified with approximately 953 proteins quantified per cell. At 48 h of treatment, two distinct populations of cells can be observed based on the level of effectiveness of the drug in decreasing the total abundance of the KRAS protein in each respective cell, with results that are effectively masked in the bulk cell analysis. All mass spectrometry data and processed results are publicly available at www.massive.ucsd.edu at accessions PXD039597, PXD039601, and PXD039600.
Collapse
Affiliation(s)
- Benjamin C. Orsburn
- The Department of Pharmacology and
Molecular Sciences, The Johns Hopkins University
School of Medicine, Baltimore, Maryland 21205, United States
| |
Collapse
|
8
|
Orsburn BC. An integrated method for single cell proteomics with simultaneous measurements of intracellular drug concentration implicates new mechanisms for adaptation to KRAS G12D inhibitors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.18.567669. [PMID: 38014353 PMCID: PMC10680798 DOI: 10.1101/2023.11.18.567669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
It is well established that a population of single human cells will often respond to the same drug treatment in a heterogeneous manner. In the context of chemotherapeutics, these diverse responses may lead to individual adaptation mechanisms and ultimately multiple distinct methods of resistance. The obvious question from a pharmacology perspective is how intracellular concentrations of active drug varies between individual cells, and what role does that variation play in drug response heterogeneity? To date, no integrated methods for rapidly measuring intracellular drug levels while simultaneously measuring drug responses have been described. This study describes a method for single cell preparation that allows proteins to be extracted and digested from single cells while maintaining conditions for small molecules to be simultaneously measured. The method as described allows up to 40 cells to be analyzed per instrument per day. When applied to a KRASG12D small molecule inhibitor I observe a wide degree of intracellular levels of the drug, and that proteomic responses largely stratify based on the concentration of drug within each single cell. Further work is in progress to develop and standardize this method and - more importantly - to normalize drug measurements against direct measurements of cell volume. However, these preliminary results appear promising for the identification of single cells with unique drug response mechanisms. All data described in this study has been made publicly available through the ProteomeXchange consortium under accession PXD046002.
Collapse
Affiliation(s)
- Benjamin C. Orsburn
- The Department of Pharmacology and Molecular Sciences, The Johns Hopkins University School of Medicine, Baltimore, MD, USA, 21205
| |
Collapse
|
9
|
Lazear MR. Sage: An Open-Source Tool for Fast Proteomics Searching and Quantification at Scale. J Proteome Res 2023; 22:3652-3659. [PMID: 37819886 DOI: 10.1021/acs.jproteome.3c00486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
The growing complexity and volume of proteomics data necessitate the development of efficient software tools for peptide identification and quantification from mass spectra. Given their central role in proteomics, it is imperative that these tools are auditable and extensible─requirements that are best fulfilled by open-source and permissively licensed software. This work presents Sage, a high-performance, open-source, and freely available proteomics pipeline. Scalable and cloud-ready, Sage matches the performance of state-of-the-art software tools while running an order of magnitude faster.
Collapse
Affiliation(s)
- Michael R Lazear
- Belharra Therapeutics, 3985 Sorrento Valley Boulevard Suite C, San Diego, California 92121, United States
| |
Collapse
|
10
|
Orsburn BC. Metabolomic, proteomic and single cell proteomic analysis of cancer cells treated with the KRAS G12D inhibitor MRTX1133. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.23.533981. [PMID: 36993160 PMCID: PMC10055375 DOI: 10.1101/2023.03.23.533981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Mutations in KRAS are common drivers of human cancers and are often those with the poorest overall prognosis for patients. A recently developed compound, MRTX1133, has shown promise in inhibiting the activity of KRASG12D mutant proteins, one of the main drivers in pancreatic cancer. To better understand the mechanism of action of this compound I performed both proteomics and metabolomics on four KRASG12D mutant pancreatic cancer cell lines. To obtain increased granularity in the proteomic observations, single cell proteomics was successfully performed on two of these lines. Following quality filtering, a total of 1,498 single cells were analyzed. From these cells 3,140 total proteins were identified with approximately 953 proteins quantified per cell. At 48 hours of treatment, two distinct populations of cells can be observed based on the level of effectiveness of the drug in decreasing total abundance of the KRAS protein in each respective cell, results that are effectively masked in the bulk cell analysis. All mass spectrometry data and processed results are publicly available at the www.massive.ucsd.edu at accessions PXD039597, PXD039601 and PXD039600.
Collapse
Affiliation(s)
- Benjamin C. Orsburn
- The Department of Pharmacology and Molecular Sciences, The Johns Hopkins University School of Medicine, Baltimore, MD, USA, 21205
| |
Collapse
|
11
|
Warshanna A, Orsburn BC. SCP Viz - A universal graphical user interface for single protein analysis in single cell proteomics datasets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.29.555397. [PMID: 37693496 PMCID: PMC10491148 DOI: 10.1101/2023.08.29.555397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Single cell proteomics (SCP) requires the analysis of dozens to thousands of single human cells to draw biological conclusions. However, assessing of the abundance of single proteins in output data presents a considerable challenge, and no simple universal solutions currently exist. To address this, we developed SCP Viz, a statistical package with a graphical user interface that can handle small and large scale SCP output from any instrument or data processing software. In this software, the abundance of individual proteins can be plotted in a variety of ways, using either unadjusted or normalized outputs. These outputs can also be transformed or imputed within the software. SCP Viz offers a variety of plotting options which can help identify significantly altered proteins between groups, both before and after quantitative transformations. Upon the discovery of subpopulations of single cells, users can easily regroup the cells of interest using straightforward text-based filters. When used in this way, SCP Viz allows users to visualize proteomic heterogeneity at the level of individual proteins, cells, or identified subcellular populations. SCP Viz is compatible with output files from MaxQuant, FragPipe, SpectroNaut, and Proteome Discoverer, and should work equally well with other formats. SCP Viz is publicly available at https://github.com/orsburn/SCPViz. For demonstrations, users can download our test data from GitHub and use an online version that accepts user input for analysis at https://orsburnlab.shinyapps.io/SCPViz/.
Collapse
Affiliation(s)
- Ahmed Warshanna
- The Department of Pharmacology and Molecular Sciences; The Johns Hopkins University School of Medicine, Baltimore, MD, USA, 21205
| | - Benjamin C. Orsburn
- The Department of Pharmacology and Molecular Sciences; The Johns Hopkins University School of Medicine, Baltimore, MD, USA, 21205
| |
Collapse
|
12
|
Akintayo SO, Hosseini B, Vahidinasab M, Messmer M, Pfannstiel J, Bertsche U, Hubel P, Henkel M, Hausmann R, Voegele RT, Lilge L. Characterization ofantifungal properties of lipopeptide-producing Bacillus velezensis strains and their proteome-based response to the phytopathogens, Diaporthe spp. Front Bioeng Biotechnol 2023; 11:1228386. [PMID: 37609113 PMCID: PMC10440741 DOI: 10.3389/fbioe.2023.1228386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 07/28/2023] [Indexed: 08/24/2023] Open
Abstract
Introduction: B. velezensis strains are of interest in agricultural applications due to their beneficial interactions with plants, notable through their antimicrobial activity. The biocontrol ability of two new lipopeptides-producing B. velezensis strains ES1-02 and EFSO2-04, against fungal phytopathogens of Diaporthe spp., was evaluated and compared with reference strains QST713 and FZB42. All strains were found to be effective against the plant pathogens, with the new strains showing comparable antifungal activity to QST713 and slightly lower activity than FZB42. Methods: Lipopeptides and their isoforms were identified by high-performance thin-layer chromatography (HPTLC) and mass spectrometric measurements. The associated antifungal influences were determined in direct in vitro antagonistic dual culture assays, and the inhibitory growth effects on Diaporthe spp. as representatives of phytopathogenic fungi were determined. The effects on bacterial physiology of selected B. velezensis strains were analyzed by mass spectrometric proteomic analyses using nano-LC-MS/MS. Results and Discussion: Lipopeptide production analysis revealed that all strains produced surfactin, and one lipopeptide of the iturin family, including bacillomycin L by ES1-02 and EFSO2-04, while QST713 and FZB42 produced iturin A and bacillomycin D, respectively. Fengycin production was however only detected in the reference strains. As a result of co-incubation of strain ES1-02 with the antagonistic phytopathogen D. longicolla, an increase in surfactin production of up to 10-fold was observed, making stress induction due to competitors an attractive strategy for surfactin bioproduction. An associated global proteome analysis showed a more detailed overview about the adaptation and response mechanisms of B. velezensis, including an increased abundance of proteins associated with the biosynthesis of antimicrobial compounds. Furthermore, higher abundance was determined for proteins associated with oxidative, nitrosative, and general stress response. In contrast, proteins involved in phosphate uptake, amino acid transport, and translation were decreased in abundance. Altogether, this study provides new insights into the physiological adaptation of lipopeptide-producing B. velezensis strains, which show the potential for use as biocontrol agents with respect to phytopathogenic fungi.
Collapse
Affiliation(s)
- Stephen Olusanmi Akintayo
- Department of Bioprocess Engineering, Institute of Food Science and Biotechnology, University of Hohenheim, Stuttgart, Germany
| | - Behnoush Hosseini
- Department of Phytopathology, Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
| | - Maliheh Vahidinasab
- Department of Bioprocess Engineering, Institute of Food Science and Biotechnology, University of Hohenheim, Stuttgart, Germany
| | - Marc Messmer
- Department of Bioprocess Engineering, Institute of Food Science and Biotechnology, University of Hohenheim, Stuttgart, Germany
| | - Jens Pfannstiel
- Core Facility Hohenheim, Mass Spectrometry Core Facility, University of Hohenheim, Stuttgart, Germany
| | - Ute Bertsche
- Core Facility Hohenheim, Mass Spectrometry Core Facility, University of Hohenheim, Stuttgart, Germany
| | - Philipp Hubel
- Core Facility Hohenheim, Mass Spectrometry Core Facility, University of Hohenheim, Stuttgart, Germany
| | - Marius Henkel
- Cellular Agriculture, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Rudolf Hausmann
- Department of Bioprocess Engineering, Institute of Food Science and Biotechnology, University of Hohenheim, Stuttgart, Germany
| | - Ralf T. Voegele
- Department of Phytopathology, Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
| | - Lars Lilge
- Department of Bioprocess Engineering, Institute of Food Science and Biotechnology, University of Hohenheim, Stuttgart, Germany
- Department of Molecular Genetics, University of Groningen, Groningen, Netherlands
| |
Collapse
|
13
|
Orsburn BC. Acetic acid is a superior acidifier for sub-nanogram and single cell proteomic studies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.01.551522. [PMID: 37577694 PMCID: PMC10418182 DOI: 10.1101/2023.08.01.551522] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
A recent study demonstrated a substantial signal increase when employing a 0.5% acetic acid buffer additive instead of the traditional 0.1% formic acid used in shotgun proteomics. In this study I compare these two buffers for a dilution series of tryptic digests down to 20 picograms peptide on column on a TIMSTOF single cell proteome (SCP) system. I observe a comparable relative level of signal increase as previously reported, which translates to improvements in proteome coverage at every peptide load assessed. The relative increase in peptide identifications is more apparent at lower concentrations with a striking 1.8-fold more peptides identified at 20 pg peptide load, resulting in over 2,000 protein groups identified in 30 minutes on this system. These results translate well to isolated single human cancer cells allowing over 1,000 protein groups to be identified in single human cells processed using a simple one step method in standard 96-well plates. All vendor raw and processed data has been made publicly available at www.massive.ucsd.edu and can be accessed as MSV000092563.
Collapse
Affiliation(s)
- Benjamin C. Orsburn
- The Department of Pharmacology and Molecular Sciences, Single Cell Proteomics Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA, 21205
| |
Collapse
|
14
|
Matzinger M, Mayer RL, Mechtler K. Label-free single cell proteomics utilizing ultrafast LC and MS instrumentation: A valuable complementary technique to multiplexing. Proteomics 2023; 23:e2200162. [PMID: 36806919 PMCID: PMC10909491 DOI: 10.1002/pmic.202200162] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/21/2023]
Abstract
The ability to map a proteomic fingerprint to transcriptomic data would master the understanding of how gene expression translates into actual phenotype. In contrast to nucleic acid sequencing, in vitro protein amplification is impossible and no single cell proteomic workflow has been established as gold standard yet. Advances in microfluidic sample preparation, multi-dimensional sample separation, sophisticated data acquisition strategies, and intelligent data analysis algorithms have resulted in major improvements to successfully analyze such tiny sample amounts with steadily boosted performance. However, among the broad variation of published approaches, it is commonly accepted that highest possible sensitivity, robustness, and throughput are still the most urgent needs for the field. While many labs have focused on multiplexing to achieve these goals, label-free SCP is a highly promising strategy as well whenever high dynamic range and unbiased accurate quantification are needed. We here focus on recent advances in label-free single-cell mass spectrometry workflows and try to guide our readers to choose the best method or combinations of methods for their specific applications. We further highlight which techniques are most propitious in the future and which applications but also limitations we foresee for the field.
Collapse
Affiliation(s)
- Manuel Matzinger
- Research Institute of Molecular Pathology (IMP)Vienna BioCenterViennaAustria
| | - Rupert L. Mayer
- Research Institute of Molecular Pathology (IMP)Vienna BioCenterViennaAustria
| | - Karl Mechtler
- Research Institute of Molecular Pathology (IMP)Vienna BioCenterViennaAustria
- Gregor Mendel Institute of Molecular Plant Biology (GMI), Austrian Academy of SciencesVienna BioCenter (VBC)ViennaAustria
- Institute of Molecular Biotechnology (IMBA), Austrian Academy of SciencesVienna BioCenter (VBC)ViennaAustria
| |
Collapse
|
15
|
Zaza G, Neri F, Bruschi M, Granata S, Petretto A, Bartolucci M, di Bella C, Candiano G, Stallone G, Gesualdo L, Furian L. Proteomics reveals specific biological changes induced by the normothermic machine perfusion of donor kidneys with a significant up-regulation of Latexin. Sci Rep 2023; 13:5920. [PMID: 37041202 PMCID: PMC10090051 DOI: 10.1038/s41598-023-33194-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 04/08/2023] [Indexed: 04/13/2023] Open
Abstract
Renal normothermic machine perfusion (NMP) is an organ preservation method based on the circulation of a warm (35-37 °C) perfusion solution through the renal vasculature to deliver oxygen and nutrients. However, its biological effects on marginal kidneys are unclear. We therefore used mass spectrometry to determine the proteomic profile of kidney tissue and urine from eight organs reconditioned for 120 min using a Kidney Assist device. Biopsies were taken during the pre-implantation histological evaluation (T-1), at the start of back table preparation (T0), and after 60 and 120 min of perfusion (T60, T120). Urine samples were collected at T0 (urine produced in the first 15 min after the beginning of normothermic reperfusion), T30, T60 and T120. Multiple algorithms, support vector machine learning and partial least squares discriminant analysis were used to select the most discriminative proteins during NMP. Statistical analysis revealed the upregulation of 169 proteins and the downregulation of 196 during NMP. Machine learning algorithms identified the top 50 most discriminative proteins, five of which were concomitantly upregulated (LXN, ETFB, NUDT3, CYCS and UQCRC1) and six downregulated (CFHR3, C1S, CFI, KNG1, SERPINC1 and F9) in the kidney and urine after NMP. Latexin (LXN), an endogenous carboxypeptidase inhibitor, resulted the most-upregulated protein at T120, and this result was confirmed by ELISA. In addition, functional analysis revealed that the most strongly upregulated proteins were involved in the oxidative phosphorylation system and ATP synthesis, whereas the downregulated proteins represented the complement system and coagulation cascade. Our proteomic analysis demonstrated that even brief periods of NMP induce remarkable metabolic and biochemical changes in marginal organs, which supports the use of this promising technique in the clinic.
Collapse
Affiliation(s)
- Gianluigi Zaza
- Nephrology, Dialysis and Transplantation Unit, Department of Medical and Surgical Sciences, University-Hospital of Foggia, Via L. Pinto 1, 71122, Foggia, Italy.
| | - Flavia Neri
- Kidney and Pancreas Transplantation Unit, University of Padua, Padua, Italy
| | - Maurizio Bruschi
- Laboratory of Molecular Nephrology, IRCCS Istituto Giannina Gaslini, Genoa, Italy
- Department of Experimental Medicine (DIMES), University of Genoa, Genoa, Italy
| | - Simona Granata
- Nephrology, Dialysis and Transplantation Unit, Department of Medical and Surgical Sciences, University-Hospital of Foggia, Via L. Pinto 1, 71122, Foggia, Italy
| | - Andrea Petretto
- Core Facilities - Proteomica E Metabolomica Clinica, IRCCS Istituto Giannina Gaslini, Genova, Italy
| | - Martina Bartolucci
- Core Facilities - Proteomica E Metabolomica Clinica, IRCCS Istituto Giannina Gaslini, Genova, Italy
| | - Caterina di Bella
- Kidney and Pancreas Transplantation Unit, University of Padua, Padua, Italy
| | - Giovanni Candiano
- Laboratory of Molecular Nephrology, IRCCS Istituto Giannina Gaslini, Genoa, Italy
| | - Giovanni Stallone
- Nephrology, Dialysis and Transplantation Unit, Department of Medical and Surgical Sciences, University-Hospital of Foggia, Via L. Pinto 1, 71122, Foggia, Italy
| | - Loreto Gesualdo
- Nephrology, Dialysis and Transplantation Unit, Department of Precision and Regenerative Medicine and Ionian Area (DiMePRe-J), University of Bari "Aldo Moro", Bari, Italy
| | - Lucrezia Furian
- Kidney and Pancreas Transplantation Unit, University of Padua, Padua, Italy
| |
Collapse
|
16
|
Mansuri MS, Williams K, Nairn AC. Uncovering biology by single-cell proteomics. Commun Biol 2023; 6:381. [PMID: 37031277 PMCID: PMC10082756 DOI: 10.1038/s42003-023-04635-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 02/25/2023] [Indexed: 04/10/2023] Open
Abstract
Recent technological advances have opened the door to single-cell proteomics that can answer key biological questions regarding how protein expression, post-translational modifications, and protein interactions dictate cell state in health and disease.
Collapse
Affiliation(s)
- M Shahid Mansuri
- Yale/NIDA Neuroproteomics Center and Department of Molecular Biophysics and Biochemistry, Yale School of Medicine, New Haven, Connecticut, USA
| | - Kenneth Williams
- Yale/NIDA Neuroproteomics Center and Department of Molecular Biophysics and Biochemistry, Yale School of Medicine, New Haven, Connecticut, USA
| | - Angus C Nairn
- Yale/NIDA Neuroproteomics Center and Department of Psychiatry, Yale School of Medicine, New Haven, Connecticut, USA.
| |
Collapse
|
17
|
Abstract
Accurate protein quantification is key to identifying protein markers, regulatory relationships between proteins, and pathophysiological mechanisms. Realizing this potential requires sensitive and deep protein analysis of a large number of samples. Toward this goal, proteomics throughput can be increased by parallelizing the analysis of both precursors and samples using multiplexed data independent acquisition (DIA) implemented by the plexDIA framework: https://plexDIA.slavovlab.net. Here we demonstrate the improved precisions of retention time estimates within plexDIA and how this enables more accurate protein quantification. plexDIA has demonstrated multiplicative gains in throughput, and these gains may be substantially amplified by improving the multiplexing reagents, data acquisition, and interpretation. We discuss future directions for advancing plexDIA, which include engineering optimized mass-tags for high-plexDIA, introducing isotopologous carriers, and developing algorithms that utilize the regular structures of plexDIA data to improve sensitivity, proteome coverage, and quantitative accuracy. These advances in plexDIA will increase the throughput of functional proteomic assays, including quantifying protein conformations, turnover dynamics, modifications states and activities. The sensitivity of these assays will extend to single-cell analysis, thus enabling functional single-cell protein analysis.
Collapse
Affiliation(s)
- Jason Derks
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single Cell Proteomics Center, and Barnett Institute, Northeastern University, Boston, Massachusetts 02115, United States
| | - Nikolai Slavov
- Departments of Bioengineering, Biology, Chemistry and Chemical Biology, Single Cell Proteomics Center, and Barnett Institute, Northeastern University, Boston, Massachusetts 02115, United States
- Parallel Squared Technology Institute, Watertown, Massachusetts 02472, United States
| |
Collapse
|
18
|
Orsburn BC. Time-of-Flight Fragmentation Spectra Generated by the Proteomic Analysis of Single Human Cells Do Not Exhibit Atypical Fragmentation Patterns. J Proteome Res 2023; 22:1003-1008. [PMID: 36700448 PMCID: PMC10502792 DOI: 10.1021/acs.jproteome.2c00715] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Indexed: 01/27/2023]
Abstract
Recent work detailed the unique characteristics of fragmentation spectra derived from peptides from single human cells. This valuable report utilized an ultrahigh-field Orbitrap and directly compared the spectra obtained from high-concentration bulk cell HeLa lysates to those obtained from nanogram dilutions of the same and from nanowell-processed single HeLa cells. The analysis demonstrated marked differences between the fragmentation spectra generated at high and single-cell loads, most strikingly, the loss of high-mass y-series fragment ions. As significant differences exist in the physics of Orbitrap and time-of-flight mass analyzers, a comparison appeared warranted. A similar analysis was performed using isolated single pancreatic cancer cells compared to pools consisting of 100 cells. While a reanalysis of the prior Orbitrap data supports the author's original findings, the same trends are not observed in time-of-flight mass spectra of peptides from single human cells. The results are particularly striking when directly comparing the matched intensity fragment values between bulk and single-cell data generated on the same mass analyzers. Instrument acquisition files, processed data, and spectrum libraries are publicly available on MASSIVE via accession MSV000090635.
Collapse
Affiliation(s)
- Benjamin C. Orsburn
- The Department of Pharmacology
and Molecular SciencesThe Johns Hopkins
University School of Medicine, Baltimore, Maryland21205, United States
| |
Collapse
|
19
|
Advances in Mass Spectrometry-Based Single Cell Analysis. BIOLOGY 2023; 12:biology12030395. [PMID: 36979087 PMCID: PMC10045136 DOI: 10.3390/biology12030395] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 02/27/2023] [Accepted: 03/01/2023] [Indexed: 03/06/2023]
Abstract
Technological developments and improvements in single-cell isolation and analytical platforms allow for advanced molecular profiling at the single-cell level, which reveals cell-to-cell variation within the admixture cells in complex biological or clinical systems. This helps to understand the cellular heterogeneity of normal or diseased tissues and organs. However, most studies focused on the analysis of nucleic acids (e.g., DNA and RNA) and mass spectrometry (MS)-based analysis for proteins and metabolites of a single cell lagged until recently. Undoubtedly, MS-based single-cell analysis will provide a deeper insight into cellular mechanisms related to health and disease. This review summarizes recent advances in MS-based single-cell analysis methods and their applications in biology and medicine.
Collapse
|
20
|
Boekweg H, Payne SH. Challenges and opportunities for single cell computational proteomics. Mol Cell Proteomics 2023; 22:100518. [PMID: 36828128 PMCID: PMC10060113 DOI: 10.1016/j.mcpro.2023.100518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 02/15/2023] [Accepted: 02/17/2023] [Indexed: 02/25/2023] Open
Abstract
Single-cell proteomics is growing rapidly and has made several technological advancements. As most research has been focused on improving instrumentation and sample preparation methods, very little attention has been given to algorithms responsible for identifying and quantifying proteins. Given the inherent difference between bulk data and single-cell data, it's necessary to realize that current algorithms being employed on single-cell data were designed for bulk data, and have underlying assumptions that may not hold true for single-cell data. In order to develop and optimize algorithms for single-cell data, we need to characterize the differences between single-cell data and bulk data, and assess how current algorithms perform on single-cell data. Here, we present a review of algorithms responsible for identifying and quantifying peptides and proteins. We will give a review of how each type of algorithm works, assumptions it relies on, how it performs on single-cell data, and possible optimizations and solutions that could be used to address the differences in single-cell data.
Collapse
Affiliation(s)
- Hannah Boekweg
- Biology Department, Brigham Young University, Provo, Utah, USA
| | - Samuel H Payne
- Biology Department, Brigham Young University, Provo, Utah, USA.
| |
Collapse
|
21
|
Wu Y, Zhang W, Zhao Y, Wang X, Guo G. Technology development trend of electrospray ionization mass spectrometry for single-cell proteomics. Trends Analyt Chem 2022. [DOI: 10.1016/j.trac.2022.116913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
22
|
Senavirathna L, Ma C, Chen R, Pan S. Spectral Library-Based Single-Cell Proteomics Resolves Cellular Heterogeneity. Cells 2022; 11:cells11152450. [PMID: 35954294 PMCID: PMC9368228 DOI: 10.3390/cells11152450] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 07/29/2022] [Accepted: 08/04/2022] [Indexed: 02/07/2023] Open
Abstract
Dissecting the proteome of cell types and states at single-cell resolution, while being highly challenging, has significant implications in basic science and biomedicine. Mass spectrometry (MS)-based single-cell proteomics represents an emerging technology for system-wide, unbiased profiling of proteins in single cells. However, significant challenges remain in analyzing an extremely small amount of proteins collected from a single cell, as a proteome-wide amplification of proteins is not currently feasible. Here, we report an integrated spectral library-based single-cell proteomics (SLB-SCP) platform that is ultrasensitive and well suited for a large-scale analysis. To overcome the low MS/MS signal intensity intrinsically associated with a single-cell analysis, this approach takes an alternative approach by extracting a breadth of information that specifically defines the physicochemical characteristics of a peptide from MS1 spectra, including monoisotopic mass, isotopic distribution, and retention time (hydrophobicity), and uses a spectral library for proteomic identification. This conceptually unique MS platform, coupled with the DIRECT sample preparation method, enabled identification of more than 2000 proteins in a single cell to distinguish different proteome landscapes associated with cellular types and heterogeneity. We characterized individual normal and cancerous pancreatic ductal cells (HPDE and PANC-1, respectively) and demonstrated the substantial difference in the proteomes between HPDE and PANC-1 at the single-cell level. A significant upregulation of multiple protein networks in cancer hallmarks was identified in the PANC-1 cells, functionally discriminating the PANC-1 cells from the HPDE cells. This integrated platform can be built on high-resolution MS and widely accepted proteomic software, making it possible for community-wide applications.
Collapse
Affiliation(s)
- Lakmini Senavirathna
- The Brown Foundation Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Cheng Ma
- The Brown Foundation Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Ru Chen
- Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sheng Pan
- The Brown Foundation Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Department of Integrative Biology and Pharmacology, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Correspondence:
| |
Collapse
|
23
|
Beck L, Geiger T. MS-based technologies for untargeted single-cell proteomics. Curr Opin Biotechnol 2022; 76:102736. [DOI: 10.1016/j.copbio.2022.102736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 03/19/2022] [Accepted: 04/24/2022] [Indexed: 11/28/2022]
|
24
|
Single-Cell Proteomics: The Critical Role of Nanotechnology. Int J Mol Sci 2022; 23:ijms23126707. [PMID: 35743151 PMCID: PMC9224324 DOI: 10.3390/ijms23126707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 06/13/2022] [Accepted: 06/14/2022] [Indexed: 11/24/2022] Open
Abstract
In single-cell analysis, biological variability can be attributed to individual cells, their specific state, and the ability to respond to external stimuli, which are determined by protein abundance and their relative alterations. Mass spectrometry (MS)-based proteomics (e.g., SCoPE-MS and SCoPE2) can be used as a non-targeted method to detect molecules across hundreds of individual cells. To achieve high-throughput investigation, novel approaches in Single-Cell Proteomics (SCP) are needed to identify and quantify proteins as accurately as possible. Controlling sample preparation prior to LC-MS analysis is critical, as it influences sensitivity, robustness, and reproducibility. Several nanotechnological approaches have been developed for the removal of cellular debris, salts, and detergents, and to facilitate systematic sample processing at the nano- and microfluidic scale. In addition, nanotechnology has enabled high-throughput proteomics analysis, which have required the improvement of software tools, such as DART-ID or DO-MS, which are also fundamental for addressing key biological questions. Single-cell proteomics has many applications in nanomedicine and biomedical research, including advanced cancer immunotherapies or biomarker characterization, among others; and novel methods allow the quantification of more than a thousand proteins while analyzing hundreds of single cells.
Collapse
|
25
|
Ochoteco Asensio J, Verheijen M, Caiment F. Predicting missing proteomics values using machine learning: Filling the gap using transcriptomics and other biological features. Comput Struct Biotechnol J 2022; 20:2057-2069. [PMID: 35601960 PMCID: PMC9077535 DOI: 10.1016/j.csbj.2022.04.017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 04/12/2022] [Accepted: 04/12/2022] [Indexed: 12/11/2022] Open
Abstract
Proteins are often considered the main biological element in charge of the different functions and structures of a cell. However, proteomics, the global study of all expressed proteins, often performed by mass spectrometry, is limited by its stochastic sampling and can only quantify a limited amount of protein per sample. Transcriptomics, which allows an exhaustive analysis of all expressed transcripts, is often used as a surrogate. However, the transcript level does not present a high level of correlation with the corresponding protein level, notably due to the existence of several post-transcriptional regulatory mechanisms. In this publication, we hypothesize that the missing protein values in proteomics could be predicted using machine learning regression methods, trained with many features extracted from transcriptomics, including known translational regulatory elements such as microRNAs and circular RNAs. After considering different machine learning algorithms applied on two different splitting strategies, we report that random forest can predict proteins in new samples out of transcriptomics data with good accuracy. The proposed pre-processing and model building scripts can be accessed on GitHub: https://github.com/jochotecoa/ml_proteomics.
Collapse
|
26
|
Morales-Sanfrutos J, Munoz J. UNRAVELLING THE COMPLEXITY OF THE EXTRACELLULAR VESICLE LANDSCAPE WITH ADVANCED PROTEOMICS. Expert Rev Proteomics 2022; 19:89-101. [PMID: 35290757 DOI: 10.1080/14789450.2022.2052849] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
INTRODUCTION The field of extracellular vesicles (EVs) is rapidly advancing. This progress is fuelled by the potential applications of these agents as biomarkers and also as an attractive source to encapsulate therapeutics and other agents to target specific cells. AREAS COVERED Different types of EVs, including exosomes, and other nanoparticles have been identified in the last years with key regulatory functions in cell-cell communication. However, the techniques used for their purification possess inherent limitations, resulting in heterogeneous preparations contaminated by other EVs subtypes and nano-size structures. It is therefore urgent to deconvolute the molecular constituents present in each type of EVs in order to accurately ascribe their specific functions. In this context, proteomics can profile, not only the lumen proteins and surface markers, but also their post-translational modifications, which will inform on the mechanisms of cargo selection and sorting. EXPERT OPINION Mass spectrometry-based proteomics is now a mature technique and has started to deliver new insights in the EV field. Here, we review recent developments in sample preparation, mass spectrometry (MS) and computational analysis and discuss how these technological advances, in conjunction with improved purification protocols, could impact the proteomic characterization of the complex landscape of EVs and other secreted nanoparticles.
Collapse
Affiliation(s)
| | - Javier Munoz
- Proteomics Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain.,Cell Signaling and Clinical Proteomics Group. Biocruces Bizkaia Health Research Institute. 48903 Barkaldo, Spain.,Ikerbasque, Basque foundation for science, Bilbao, Spain
| |
Collapse
|
27
|
Proteomic analysis of archival breast cancer clinical specimens identifies biological subtypes with distinct survival outcomes. Nat Commun 2022; 13:896. [PMID: 35173148 PMCID: PMC8850446 DOI: 10.1038/s41467-022-28524-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 01/24/2022] [Indexed: 12/14/2022] Open
Abstract
Despite advances in genomic classification of breast cancer, current clinical tests and treatment decisions are commonly based on protein level information. Formalin-fixed paraffin-embedded (FFPE) tissue specimens with extended clinical outcomes are widely available. Here, we perform comprehensive proteomic profiling of 300 FFPE breast cancer surgical specimens, 75 of each PAM50 subtype, from patients diagnosed in 2008-2013 (n = 178) and 1986-1992 (n = 122) with linked clinical outcomes. These two cohorts are analyzed separately, and we quantify 4214 proteins across all 300 samples. Within the aggressive PAM50-classified basal-like cases, proteomic profiling reveals two groups with one having characteristic immune hot expression features and highly favorable survival. Her2-Enriched cases separate into heterogeneous groups differing by extracellular matrix, lipid metabolism, and immune-response features. Within 88 triple-negative breast cancers, four proteomic clusters display features of basal-immune hot, basal-immune cold, mesenchymal, and luminal with disparate survival outcomes. Our proteomic analysis characterizes the heterogeneity of breast cancer in a clinically-applicable manner, identifies potential biomarkers and therapeutic targets, and provides a resource for clinical breast cancer classification. Protein level information enables the identification of potential biomarkers and therapeutic targets for breast cancer. Here, the authors perform proteomic analysis of 2 cohorts of breast cancer surgical specimens and identify distinct subtypes, immune features and survival outcomes.
Collapse
|
28
|
Madej D, Wu L, Lam H. Common Decoy Distributions Simplify False Discovery Rate Estimation in Shotgun Proteomics. J Proteome Res 2022; 21:339-348. [PMID: 34989576 DOI: 10.1021/acs.jproteome.1c00600] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
In shotgun proteomics, false discovery rate (FDR) estimation is a necessary step to ensure the quality of accepted peptide-spectrum matches (PSMs) from a database search. Popular statistical validation tools for FDR control tend to rely on target-decoy searching to build empirical, dataset-specific models, which often leads to inaccurate FDR estimates. In this paper, we propose a new approach named common decoy distribution (CDD) to FDR estimation using the idea of a fixed empirical null score distribution derived from millions of peptide tandem mass spectra. To demonstrate the viability of CDD, its stability with respect to noise and the presence of unexpected peptide modifications was evaluated. PeptideProphet-based implementation of CDD was benchmarked against decoy-based PeptideProphet, and both methods exhibited similar accuracy of FDR estimates and retrieval of correct PSMs. The finding of this study calls for a re-evaluation of the necessity of dataset-specific target-decoy searches and illustrates the potential of Big Data approaches for statistical analysis in proteomics.
Collapse
Affiliation(s)
- Dominik Madej
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon 999077, Hong Kong, China
| | - Long Wu
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon 999077, Hong Kong, China
| | - Henry Lam
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon 999077, Hong Kong, China
| |
Collapse
|
29
|
Boekweg H, Van Der Watt D, Truong T, Johnston SM, Guise AJ, Plowey ED, Kelly RT, Payne SH. Features of Peptide Fragmentation Spectra in Single-Cell Proteomics. J Proteome Res 2021; 21:182-188. [PMID: 34920664 DOI: 10.1021/acs.jproteome.1c00670] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The goal of proteomics is to identify and quantify the complete set of proteins in a biological sample. Single-cell proteomics specializes in the identification and quantitation of proteins for individual cells, often used to elucidate cellular heterogeneity. The significant reduction in ions introduced into the mass spectrometer for single-cell samples could impact the features of MS2 fragmentation spectra. As all peptide identification software tools have been developed on spectra from bulk samples and the associated ion-rich spectra, the potential for spectral features to change is of great interest. We characterize the differences between single-cell spectra and bulk spectra by examining three fundamental spectral features that are likely to affect peptide identification performance. All features show significant changes in single-cell spectra, including the loss of annotated fragment ions, blurring signal and background peaks due to diminishing ion intensity, and distinct fragmentation pattern, compared to bulk spectra. As each of these features is a foundational part of peptide identification algorithms, it is critical to adjust algorithms to compensate for these losses.
Collapse
Affiliation(s)
- Hannah Boekweg
- Department of Biology, Brigham Young University, Provo Utah 84602, United States
| | - Daisha Van Der Watt
- Department of Biology, Brigham Young University, Provo Utah 84602, United States
| | - Thy Truong
- Department of Chemistry and Biochemistry, Brigham Young University, Provo Utah 84602, United States
| | - S Madisyn Johnston
- Department of Chemistry and Biochemistry, Brigham Young University, Provo Utah 84602, United States
| | - Amanda J Guise
- Translational Neuropathology, Biomarkers, Research and Development, Biogen Inc, Cambridge, Massachusetts 02142, United States
| | - Edward D Plowey
- Translational Neuropathology, Biomarkers, Research and Development, Biogen Inc, Cambridge, Massachusetts 02142, United States
| | - Ryan T Kelly
- Department of Chemistry and Biochemistry, Brigham Young University, Provo Utah 84602, United States
| | - Samuel H Payne
- Department of Biology, Brigham Young University, Provo Utah 84602, United States
| |
Collapse
|
30
|
Abstract
Current single-cell mass spectrometry (MS) methods can quantify thousands of peptides per single cell while detecting peptide-like features that may support the quantification of 10-fold more peptides. This 10-fold gain might be attained by innovations in data acquisition and interpretation even while using existing instrumentation. This perspective discusses possible directions for such innovations with the aim to stimulate community efforts for increasing the coverage and quantitative accuracy of single proteomics while simultaneously decreasing missing data. Parallel improvements in instrumentation, sample preparation, and peptide separation will afford additional gains. Together, these synergistic routes for innovation project a rapid growth in the capabilities of MS based single-cell protein analysis. These gains will directly empower applications of single-cell proteomics to biomedical research.
Collapse
Affiliation(s)
- Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States.,Barnett Institute, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|