51
|
Karlsson CAQ, Järnum S, Winstedt L, Kjellman C, Björck L, Linder A, Malmström JA. Streptococcus pyogenes Infection and the Human Proteome with a Special Focus on the Immunoglobulin G-cleaving Enzyme IdeS. Mol Cell Proteomics 2018; 17:1097-1111. [PMID: 29511047 PMCID: PMC5986240 DOI: 10.1074/mcp.ra117.000525] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Revised: 02/16/2018] [Indexed: 11/16/2022] Open
Abstract
Infectious diseases are characterized by a complex interplay between host and pathogen, but how these interactions impact the host proteome is unclear. Here we applied a combined mass spectrometry-based proteomics strategy to investigate how the human proteome is transiently modified by the pathogen Streptococcus pyogenes, with a particular focus on bacterial cleavage of IgG in vivo. In invasive diseases, S. pyogenes evokes a massive host response in blood, whereas superficial diseases are characterized by a local leakage of several blood plasma proteins at the site of infection including IgG. S. pyogenes produces IdeS, a protease cleaving IgG in the lower hinge region and we find highly effective IdeS-cleavage of IgG in samples from local IgG poor microenvironments. The results show that IdeS contributes to the adaptation of S. pyogenes to its normal ecological niches. Additionally, the work identifies novel clinical opportunities for in vivo pathogen detection.
Collapse
Affiliation(s)
- Christofer A Q Karlsson
- From the ‡Lund University, Division of Infection Medicine, Department of Clinical Sciences, Solvegatan 19, BMC, Lund, 221 84 Lund, Sweden
| | - Sofia Järnum
- §Hansa Medical AB, Scheelevägen 22, 223 63 Lund, Sweden
| | - Lena Winstedt
- §Hansa Medical AB, Scheelevägen 22, 223 63 Lund, Sweden
| | | | - Lars Björck
- From the ‡Lund University, Division of Infection Medicine, Department of Clinical Sciences, Solvegatan 19, BMC, Lund, 221 84 Lund, Sweden
| | - Adam Linder
- From the ‡Lund University, Division of Infection Medicine, Department of Clinical Sciences, Solvegatan 19, BMC, Lund, 221 84 Lund, Sweden
| | - Johan A Malmström
- From the ‡Lund University, Division of Infection Medicine, Department of Clinical Sciences, Solvegatan 19, BMC, Lund, 221 84 Lund, Sweden;
| |
Collapse
|
52
|
Rolandsson Enes S, Åhrman E, Palani A, Hallgren O, Bjermer L, Malmström A, Scheding S, Malmström J, Westergren-Thorsson G. Quantitative proteomic characterization of lung-MSC and bone marrow-MSC using DIA-mass spectrometry. Sci Rep 2017; 7:9316. [PMID: 28839187 PMCID: PMC5570998 DOI: 10.1038/s41598-017-09127-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 07/19/2017] [Indexed: 01/17/2023] Open
Abstract
Mesenchymal stromal cells (MSC) are ideal candidates for cell therapies, due to their immune-regulatory and regenerative properties. We have previously reported that lung-derived MSC are tissue-resident cells with lung-specific properties compared to bone marrow-derived MSC. Assessing relevant molecular differences between lung-MSC and bone marrow-MSC is important, given that such differences may impact their behavior and potential therapeutic use. Here, we present an in-depth mass spectrometry (MS) based strategy to investigate the proteomes of lung-MSC and bone marrow-MSC. The MS-strategy relies on label free quantitative data-independent acquisition (DIA) analysis and targeted data analysis using a MSC specific spectral library. We identified several significantly differentially expressed proteins between lung-MSC and bone marrow-MSC within the cell layer (352 proteins) and in the conditioned medium (49 proteins). Bioinformatics analysis revealed differences in regulation of cell proliferation, which was functionally confirmed by decreasing proliferation rate through Cytochrome P450 stimulation. Our study reveals important differences within proteome and matrisome profiles between lung- and bone marrow-derived MSC that may influence their behavior and affect the clinical outcome when used for cell-therapy.
Collapse
Affiliation(s)
- Sara Rolandsson Enes
- Department of Experimental Medical Science, Lung Biology Unit, Lund University, 22184, Lund, Sweden.
| | - Emma Åhrman
- Department of Experimental Medical Science, Lung Biology Unit, Lund University, 22184, Lund, Sweden.,Department of Clinical Sciences Lund, Division of Infection Medicine, Lund University, 22184, Lund, Sweden
| | - Anitha Palani
- Department of Experimental Medical Science, Matrix Biology, Lund University, 22184, Lund, Sweden
| | - Oskar Hallgren
- Department of Experimental Medical Science, Lung Biology Unit, Lund University, 22184, Lund, Sweden
| | - Leif Bjermer
- Department of Respiratory Medicine and Allergology, Lund University and Skåne University Hospital, 22184, Lund, Sweden
| | - Anders Malmström
- Department of Experimental Medical Science, Matrix Biology, Lund University, 22184, Lund, Sweden
| | - Stefan Scheding
- Lund Stem Cell Center, Lund University, 22184, Lund, Sweden.,Department of Hematology, Skåne University Hospital, 22184, Lund, Sweden
| | - Johan Malmström
- Department of Clinical Sciences Lund, Division of Infection Medicine, Lund University, 22184, Lund, Sweden
| | | |
Collapse
|
53
|
Collins BC, Hunter CL, Liu Y, Schilling B, Rosenberger G, Bader SL, Chan DW, Gibson BW, Gingras AC, Held JM, Hirayama-Kurogi M, Hou G, Krisp C, Larsen B, Lin L, Liu S, Molloy MP, Moritz RL, Ohtsuki S, Schlapbach R, Selevsek N, Thomas SN, Tzeng SC, Zhang H, Aebersold R. Multi-laboratory assessment of reproducibility, qualitative and quantitative performance of SWATH-mass spectrometry. Nat Commun 2017; 8:291. [PMID: 28827567 PMCID: PMC5566333 DOI: 10.1038/s41467-017-00249-5] [Citation(s) in RCA: 400] [Impact Index Per Article: 50.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Accepted: 06/12/2017] [Indexed: 01/15/2023] Open
Abstract
Quantitative proteomics employing mass spectrometry is an indispensable tool in life science research. Targeted proteomics has emerged as a powerful approach for reproducible quantification but is limited in the number of proteins quantified. SWATH-mass spectrometry consists of data-independent acquisition and a targeted data analysis strategy that aims to maintain the favorable quantitative characteristics (accuracy, sensitivity, and selectivity) of targeted proteomics at large scale. While previous SWATH-mass spectrometry studies have shown high intra-lab reproducibility, this has not been evaluated between labs. In this multi-laboratory evaluation study including 11 sites worldwide, we demonstrate that using SWATH-mass spectrometry data acquisition we can consistently detect and reproducibly quantify >4000 proteins from HEK293 cells. Using synthetic peptide dilution series, we show that the sensitivity, dynamic range and reproducibility established with SWATH-mass spectrometry are uniformly achieved. This study demonstrates that the acquisition of reproducible quantitative proteomics data by multiple labs is achievable, and broadly serves to increase confidence in SWATH-mass spectrometry data acquisition as a reproducible method for large-scale protein quantification.SWATH-mass spectrometry consists of a data-independent acquisition and a targeted data analysis strategy that aims to maintain the favorable quantitative characteristics on the scale of thousands of proteins. Here, using data generated by eleven groups worldwide, the authors show that SWATH-MS is capable of generating highly reproducible data across different laboratories.
Collapse
Affiliation(s)
- Ben C Collins
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093, Zurich, Switzerland
| | | | - Yansheng Liu
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093, Zurich, Switzerland
| | - Birgit Schilling
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, CA, 94945, USA
| | - George Rosenberger
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093, Zurich, Switzerland
- PhD. Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, 8057, Switzerland
| | - Samuel L Bader
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Daniel W Chan
- Department of Pathology, Clinical Chemistry Division, Johns Hopkins University School of Medicine, Baltimore, MD, 21231, USA
| | - Bradford W Gibson
- Buck Institute for Research on Aging, 8001 Redwood Boulevard, Novato, CA, 94945, USA
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, 94143, USA
| | - Anne-Claude Gingras
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, M5G 1X5, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, M5S 1A8, Ontario, Canada
| | - Jason M Held
- Departments of Medicine and Anesthesiology, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, MO, 63110, USA
| | - Mio Hirayama-Kurogi
- Department of Pharmaceutical Microbiology, Faculty of Life Sciences, Kumamoto University, 5-1 Oe-honmachi, Chuo-ku, Kumamoto, 862-0973, Japan
| | - Guixue Hou
- Proteomics Division, BGI-Shenzhen, Shenzhen, 518083, China
| | - Christoph Krisp
- Department of Chemistry and Biomolecular Sciences, Australian Proteome Analysis Facility (APAF), Macquarie University, Sydney, 2109, Australia
| | - Brett Larsen
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, M5G 1X5, Ontario, Canada
| | - Liang Lin
- Proteomics Division, BGI-Shenzhen, Shenzhen, 518083, China
| | - Siqi Liu
- Proteomics Division, BGI-Shenzhen, Shenzhen, 518083, China
| | - Mark P Molloy
- Department of Chemistry and Biomolecular Sciences, Australian Proteome Analysis Facility (APAF), Macquarie University, Sydney, 2109, Australia
| | - Robert L Moritz
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Sumio Ohtsuki
- Department of Pharmaceutical Microbiology, Faculty of Life Sciences, Kumamoto University, 5-1 Oe-honmachi, Chuo-ku, Kumamoto, 862-0973, Japan
| | - Ralph Schlapbach
- Functional Genomics Center Zurich, ETH Zurich/University of Zurich, Winterthurerstr. 190, 8057, Zurich, Switzerland
| | - Nathalie Selevsek
- Functional Genomics Center Zurich, ETH Zurich/University of Zurich, Winterthurerstr. 190, 8057, Zurich, Switzerland
| | - Stefani N Thomas
- Department of Pathology, Clinical Chemistry Division, Johns Hopkins University School of Medicine, Baltimore, MD, 21231, USA
| | - Shin-Cheng Tzeng
- Departments of Medicine and Anesthesiology, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, MO, 63110, USA
| | - Hui Zhang
- Department of Pathology, Clinical Chemistry Division, Johns Hopkins University School of Medicine, Baltimore, MD, 21231, USA
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093, Zurich, Switzerland.
- Faculty of Science, University of Zurich, Zurich, Switzerland.
| |
Collapse
|
54
|
Rosenberger G, Bludau I, Schmitt U, Heusel M, Hunter CL, Liu Y, MacCoss MJ, MacLean BX, Nesvizhskii AI, Pedrioli PGA, Reiter L, Röst HL, Tate S, Ting YS, Collins BC, Aebersold R. Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses. Nat Methods 2017; 14:921-927. [PMID: 28825704 PMCID: PMC5581544 DOI: 10.1038/nmeth.4398] [Citation(s) in RCA: 159] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 07/07/2017] [Indexed: 12/18/2022]
Abstract
Liquid chromatography coupled to tandem mass spectrometry is the main method for high-throughput identification and quantification of peptides and inferred proteins. Within this field, data-independent acquisition (DIA) combined with peptide-centric scoring, exemplified by SWATH-MS, emerged as a scalable method to achieve deep and consistent proteome coverage across large-scale datasets. Here we discuss the adaptation of statistical concepts developed for discovery proteomics based on spectrum-centric scoring to large-scale DIA experiments analyzed with peptide-centric scoring strategies and provide guidance on their application. We show that optimal tradeoffs between sensitivity and specificity require careful considerations of the relationship between proteins in the samples and proteins represented in the spectral library. We propose the application of a global analyte constraint to prevent accumulation of false positives across large-scale datasets. Furthermore, to increase the quality and reproducibility of published proteomic results, well-established confidence criteria should be reported for detected peptide queries, peptides and inferred proteins.
Collapse
Affiliation(s)
- George Rosenberger
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Isabell Bludau
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Uwe Schmitt
- ID Scientific IT Services, ETH Zurich, Zurich, Switzerland
| | - Moritz Heusel
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD program in Molecular and Translational Biomedicine, Competence Center Personalized Medicine (CC-PM), ETH Zurich and University of Zurich, Zurich, Switzerland
| | | | - Yansheng Liu
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Brendan X MacLean
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Alexey I Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.,Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
| | - Patrick G A Pedrioli
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | | - Hannes L Röst
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | | - Ying S Ting
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Ben C Collins
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Faculty of Science, University of Zurich, Zurich, Switzerland
| |
Collapse
|
55
|
Enhanced differential expression statistics for data-independent acquisition proteomics. Sci Rep 2017; 7:5869. [PMID: 28724900 PMCID: PMC5517573 DOI: 10.1038/s41598-017-05949-y] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Accepted: 06/07/2017] [Indexed: 01/28/2023] Open
Abstract
We describe a new reproducibility-optimization method ROPECA for statistical analysis of proteomics data with a specific focus on the emerging data-independent acquisition (DIA) mass spectrometry technology. ROPECA optimizes the reproducibility of statistical testing on peptide-level and aggregates the peptide-level changes to determine differential protein-level expression. Using a ‘gold standard’ spike-in data and a hybrid proteome benchmark data we show the competitive performance of ROPECA over conventional protein-based analysis as well as state-of-the-art peptide-based tools especially in DIA data with consistent peptide measurements. Furthermore, we also demonstrate the improved accuracy of our method in clinical studies using proteomics data from a longitudinal human twin study.
Collapse
|
56
|
Barbieri R, Guryev V, Brandsma CA, Suits F, Bischoff R, Horvatovich P. Proteogenomics: Key Driver for Clinical Discovery and Personalized Medicine. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017; 926:21-47. [PMID: 27686804 DOI: 10.1007/978-3-319-42316-6_3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Proteogenomics is a multi-omics research field that has the aim to efficiently integrate genomics, transcriptomics and proteomics. With this approach it is possible to identify new patient-specific proteoforms that may have implications in disease development, specifically in cancer. Understanding the impact of a large number of mutations detected at the genomics level is needed to assess the effects at the proteome level. Proteogenomics data integration would help in identifying molecular changes that are persistent across multiple molecular layers and enable better interpretation of molecular mechanisms of disease, such as the causal relationship between single nucleotide polymorphisms (SNPs) and the expression of transcripts and translation of proteins compared to mainstream proteomics approaches. Identifying patient-specific protein forms and getting a better picture of molecular mechanisms of disease opens the avenue for precision and personalized medicine. Proteogenomics is, however, a challenging interdisciplinary science that requires the understanding of sample preparation, data acquisition and processing for genomics, transcriptomics and proteomics. This chapter aims to guide the reader through the technology and bioinformatics aspects of these multi-omics approaches, illustrated with proteogenomics applications having clinical or biological relevance.
Collapse
Affiliation(s)
- Ruggero Barbieri
- Department of Gastroenterology and Hepatology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Victor Guryev
- European Research Institute for the Biology of Ageing, University Medical Center Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Corry-Anke Brandsma
- Department of Pathology & Medical Biology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Frank Suits
- IBM T.J. Watson Research Centre, 1101 Kitchawan Road, Yorktown Heights, New York, 10598, NY, USA
| | - Rainer Bischoff
- Department of Analytical Biochemistry, Research Institute of Pharmacy, University of Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands
| | - Peter Horvatovich
- Department of Analytical Biochemistry, Research Institute of Pharmacy, University of Groningen, Antonius Deusinglaan 1, 9713 AV, Groningen, The Netherlands.
| |
Collapse
|
57
|
Rosenberger G, Liu Y, Röst HL, Ludwig C, Buil A, Bensimon A, Soste M, Spector TD, Dermitzakis ET, Collins BC, Malmström L, Aebersold R. Inference and quantification of peptidoforms in large sample cohorts by SWATH-MS. Nat Biotechnol 2017; 35:781-788. [PMID: 28604659 PMCID: PMC5593115 DOI: 10.1038/nbt.3908] [Citation(s) in RCA: 88] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 05/22/2017] [Indexed: 01/01/2023]
Abstract
Consistent detection and quantification of protein post-translational modifications (PTMs) across sample cohorts is a prerequisite for functional analysis of biological processes. Data-independent acquisition (DIA) is a bottom-up mass spectrometry approach that provides complete information on precursor and fragment ions. However, owing to the convoluted structure of DIA data sets, confident, systematic identification and quantification of peptidoforms has remained challenging. Here, we present inference of peptidoforms (IPF), a fully automated algorithm that uses spectral libraries to query, validate and quantify peptidoforms in DIA data sets. The method was developed on data acquired by the DIA method SWATH-MS and benchmarked using a synthetic phosphopeptide reference data set and phosphopeptide-enriched samples. IPF reduced false site-localization by more than sevenfold compared with previous approaches, while recovering 85.4% of the true signals. Using IPF, we quantified peptidoforms in DIA data acquired from >200 samples of blood plasma of a human twin cohort and assessed the contribution of heritable, environmental and longitudinal effects on their PTMs.
Collapse
Affiliation(s)
- George Rosenberger
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Yansheng Liu
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Hannes L Röst
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Department of Genetics, Stanford University, Stanford, California, USA
| | - Christina Ludwig
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Bavarian Biomolecular Mass Spectrometry Center (BayBioMS), Technical University Munich, Freising, Germany
| | - Alfonso Buil
- Research Institute of Biological Psychiatry, Mental Health Center Sct. Hans, Roskilde, Denmark
| | - Ariel Bensimon
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Martin Soste
- Department of Biology, Institute of Biochemistry, ETH Zurich, Zurich, Switzerland
| | - Tim D Spector
- Department of Twin Research and Genetic Epidemiology, King's College London, St Thomas' Hospital Campus, London, UK
| | - Emmanouil T Dermitzakis
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
| | - Ben C Collins
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Lars Malmström
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,S3IT, University of Zurich, Zurich, Switzerland
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Faculty of Science, University of Zurich, Zurich, Switzerland
| |
Collapse
|
58
|
Teleman J, Hauri S, Malmström J. Improvements in Mass Spectrometry Assay Library Generation for Targeted Proteomics. J Proteome Res 2017; 16:2384-2392. [PMID: 28516777 DOI: 10.1021/acs.jproteome.6b00928] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
In data-independent acquisition mass spectrometry (DIA-MS), targeted extraction of peptide signals in silico using mass spectrometry assay libraries is a successful method for the identification and quantification of proteins. However, it remains unclear if high quality assay libraries with more accurate peptide ion coordinates can improve peptide target identification rates in DIA analysis. In this study, we systematically improved and evaluated the common algorithmic steps for assay library generation and demonstrate that increased assay quality results in substantially higher identification rates of peptide targets from mouse organ protein lysates measured by DIA-MS. The introduced changes are (1) a new spectrum interpretation algorithm, (2) reapplication of segmented retention time normalization, (3) a ppm fragment mass error matching threshold, (4) usage of internal peptide fragments, and (5) a multilevel false discovery rate calculation. Taken together, these changes yielded 14-36% more identified peptide targets at 1% assay false discovery rate and are implemented in three new open source tools, Fraggle, Tramler, and Franklin, available at https://github.com/fickludd/eviltools . The improved algorithms provide ways to better utilize discovery MS data, translating to substantially increased DIA performance and ultimately better foundations for drawing biological conclusions in DIA-based experiments.
Collapse
Affiliation(s)
- Johan Teleman
- Department of Clinical Sciences, Lund University , BMC D13, 221 84 Lund, Sweden.,Department of Immunotechnology, Lund University , Medicon Village (Building 406), 223 81 Lund, Sweden
| | - Simon Hauri
- Department of Clinical Sciences, Lund University , BMC D13, 221 84 Lund, Sweden
| | - Johan Malmström
- Department of Clinical Sciences, Lund University , BMC D13, 221 84 Lund, Sweden
| |
Collapse
|
59
|
Kreimer S, Gao Y, Ray S, Jin M, Tan Z, Mussa NA, Tao L, Li Z, Ivanov AR, Karger BL. Host Cell Protein Profiling by Targeted and Untargeted Analysis of Data Independent Acquisition Mass Spectrometry Data with Parallel Reaction Monitoring Verification. Anal Chem 2017; 89:5294-5302. [PMID: 28402653 DOI: 10.1021/acs.analchem.6b04892] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Host cell proteins (HCPs) are process-related impurities of biopharmaceuticals that remain at trace levels despite multiple stages of downstream purification. Currently, there is interest in implementing LC-MS in biopharmaceutical HCP profiling alongside conventional ELISA, because individual species can be identified and quantitated. Conventional data dependent LC-MS is hampered by the low concentration of HCP-derived peptides, which are 5-6 orders of magnitude less abundant than the biopharmaceutical-derived peptides. In this paper, we present a novel data independent acquisition (DIA)-MS workflow to identify HCP peptides using automatically combined targeted and untargeted data processing, followed by verification and quantitation using parallel reaction monitoring (PRM). Untargeted data processing with DIA-Umpire provided a means of identifying HCPs not represented in the assay library used for targeted, peptide-centric, data analysis. An IgG1 monoclonal antibody (mAb) purified by Protein A column elution, cation exchange chromatography, and ultrafiltration was analyzed using the workflow with 1D-LC. Five protein standards added at 0.5 to 100 ppm concentrations were detected in the background of the purified mAb, demonstrating sensitivity to low ppm levels. A calibration curve was constructed on the basis of the summed peak areas of the three highest intensity fragment ions from the highest intensity peptide of each protein standard. Sixteen HCPs were identified and quantitated on the basis of the calibration curve over the range of low ppm to over 100 ppm in the purified mAb sample. The developed approach achieves rapid HCP profiling using 1D-LC and specific identification exploiting the high mass accuracy and resolution of the mass spectrometer.
Collapse
Affiliation(s)
- Simion Kreimer
- Barnett Institute and Department of Chemistry and Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States
| | - Yuanwei Gao
- Barnett Institute and Department of Chemistry and Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States
| | - Somak Ray
- Barnett Institute and Department of Chemistry and Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States
| | - Mi Jin
- Bristol-Myers Squibb , Biologics Process and Product Development, 38 Jackson Road, Devens, Massachusetts 01434, United States
| | - Zhijun Tan
- Bristol-Myers Squibb , Biologics Process and Product Development, 38 Jackson Road, Devens, Massachusetts 01434, United States
| | - Nesredin A Mussa
- Bristol-Myers Squibb , Biologics Process and Product Development, 38 Jackson Road, Devens, Massachusetts 01434, United States
| | - Li Tao
- Bristol-Myers Squibb , Biologics Process and Product Development, 38 Jackson Road, Devens, Massachusetts 01434, United States
| | - Zhengjian Li
- Bristol-Myers Squibb , Biologics Process and Product Development, 38 Jackson Road, Devens, Massachusetts 01434, United States
| | - Alexander R Ivanov
- Barnett Institute and Department of Chemistry and Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States
| | - Barry L Karger
- Barnett Institute and Department of Chemistry and Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States
| |
Collapse
|
60
|
Li S, Cao Q, Xiao W, Guo Y, Yang Y, Duan X, Shui W. Optimization of Acquisition and Data-Processing Parameters for Improved Proteomic Quantification by Sequential Window Acquisition of All Theoretical Fragment Ion Mass Spectrometry. J Proteome Res 2017; 16:738-747. [PMID: 27995803 DOI: 10.1021/acs.jproteome.6b00767] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Proteomic analysis with data-independent acquisition (DIA) approaches represented by the sequential window acquisition of all theoretical fragment ion spectra (SWATH) technique has gained intense interest in recent years because DIA is able to overcome the intrinsic weakness of conventional data-dependent acquisition (DDA) methods and afford higher throughout and reproducibility for proteome-wide quantification. Although the raw mass spectrometry (MS) data quality and the data-mining workflow conceivably influence the throughput, accuracy and consistency of SWATH-based proteomic quantification, there lacks a systematic evaluation and optimization of the acquisition and data-processing parameters for SWATH MS analysis. Herein, we evaluated the impact of major acquisition parameters such as the precursor mass range, isolation window width and accumulation time as well as the data-processing variables including peak extraction criteria and spectra library selection on SWATH performance. Fine tuning these interdependent parameters can further improve the throughput and accuracy of SWATH quantification compared to the original setting adopted in most SWATH proteomic studies. Furthermore, we compared the effectiveness of two widely used peak extraction software PeakView and Spectronaut in discovery of differentially expressed proteins in a biological context. Our work is believed to contribute to a deeper understanding of the critical factors in SWATH MS experiments and help researchers optimize their SWATH parameters and workflows depending on the sample type, available instrument and software.
Collapse
Affiliation(s)
- Shanshan Li
- iHuman Institute, ShanghaiTech University , Shanghai 201210, China
| | - Qichen Cao
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences , Tianjin 300308, China
| | - Weidi Xiao
- College of Life Sciences, Nankai University , Tianjin 300071, China
| | - Yufeng Guo
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences , Tianjin 300308, China
| | - Yunfei Yang
- College of Life Sciences, Nankai University , Tianjin 300071, China
| | - Xiaoxiao Duan
- College of Life Sciences, Nankai University , Tianjin 300071, China
| | - Wenqing Shui
- iHuman Institute, ShanghaiTech University , Shanghai 201210, China
| |
Collapse
|
61
|
Röst HL, Aebersold R, Schubert OT. Automated SWATH Data Analysis Using Targeted Extraction of Ion Chromatograms. Methods Mol Biol 2017; 1550:289-307. [PMID: 28188537 DOI: 10.1007/978-1-4939-6747-6_20] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Targeted mass spectrometry comprises a set of methods able to quantify protein analytes in complex mixtures with high accuracy and sensitivity. These methods, e.g., Selected Reaction Monitoring (SRM) and SWATH MS, use specific mass spectrometric coordinates (assays) for reproducible detection and quantification of proteins. In this protocol, we describe how to analyze, in a targeted manner, data from a SWATH MS experiment aimed at monitoring thousands of proteins reproducibly over many samples. We present a standard SWATH MS analysis workflow, including manual data analysis for quality control (based on Skyline) as well as automated data analysis with appropriate control of error rates (based on the OpenSWATH workflow). We also discuss considerations to ensure maximal coverage, reproducibility, and quantitative accuracy.
Collapse
Affiliation(s)
- Hannes L Röst
- Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Zurich, Switzerland.
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA.
| | - Ruedi Aebersold
- Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Zurich, Switzerland.
- Faculty of Science, University of Zurich, CH-8057, Zurich, Switzerland.
| | - Olga T Schubert
- Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Zurich, Switzerland.
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA, 90095, USA.
| |
Collapse
|
62
|
Röst HL, Malmström L, Aebersold R. Reproducible quantitative proteotype data matrices for systems biology. Mol Biol Cell 2016; 26:3926-31. [PMID: 26543201 PMCID: PMC4710225 DOI: 10.1091/mbc.e15-07-0507] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Historically, many mass spectrometry–based proteomic studies have aimed at compiling an inventory of protein compounds present in a biological sample, with the long-term objective of creating a proteome map of a species. However, to answer fundamental questions about the behavior of biological systems at the protein level, accurate and unbiased quantitative data are required in addition to a list of all protein components. Fueled by advances in mass spectrometry, the proteomics field has thus recently shifted focus toward the reproducible quantification of proteins across a large number of biological samples. This provides the foundation to move away from pure enumeration of identified proteins toward quantitative matrices of many proteins measured across multiple samples. It is argued here that data matrices consisting of highly reproducible, quantitative, and unbiased proteomic measurements across a high number of conditions, referred to here as quantitative proteotype maps, will become the fundamental currency in the field and provide the starting point for downstream biological analysis. Such proteotype data matrices, for example, are generated by the measurement of large patient cohorts, time series, or multiple experimental perturbations. They are expected to have a large effect on systems biology and personalized medicine approaches that investigate the dynamic behavior of biological systems across multiple perturbations, time points, and individuals.
Collapse
Affiliation(s)
- Hannes L Röst
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093 Zurich, Switzerland Department of Genetics, Stanford University, Stanford, CA 94305
| | - Lars Malmström
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093 Zurich, Switzerland S3IT, University of Zurich, CH-8057 Zurich, Switzerland
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093 Zurich, Switzerland Faculty of Science, University of Zurich, CH-8057 Zurich, Switzerland
| |
Collapse
|
63
|
Röst HL, Liu Y, D'Agostino G, Zanella M, Navarro P, Rosenberger G, Collins BC, Gillet L, Testa G, Malmström L, Aebersold R. TRIC: an automated alignment strategy for reproducible protein quantification in targeted proteomics. Nat Methods 2016; 13:777-83. [PMID: 27479329 PMCID: PMC5008461 DOI: 10.1038/nmeth.3954] [Citation(s) in RCA: 137] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2015] [Accepted: 06/14/2016] [Indexed: 12/16/2022]
Abstract
Large scale, quantitative proteomic studies have become essential for the analysis of clinical cohorts, large perturbation experiments and systems biology studies. While next-generation mass spectrometric techniques such as SWATH-MS have substantially increased throughput and reproducibility, ensuring consistent quantification of thousands of peptide analytes across multiple LC-MS/MS runs remains a challenging and laborious manual process. To produce highly consistent and quantitatively accurate proteomics data matrices in an automated fashion, we have developed the TRIC software which utilizes fragment ion data to perform cross-run alignment, consistent peak-picking and quantification for high throughput targeted proteomics. TRIC uses a graph-based alignment strategy based on non-linear retention time correction to integrate peak elution information from all LC-MS/MS runs acquired in a study. When compared to state-of-the-art SWATH-MS data analysis, the algorithm was able to reduce the identification error by more than 3-fold at constant recall, while correcting for highly non-linear chromatographic effects. On a pulsed-SILAC experiment performed on human induced pluripotent stem (iPS) cells, TRIC was able to automatically align and quantify thousands of light and heavy isotopic peak groups and substantially increased the quantitative completeness and biological information in the data, providing insights into protein dynamics of iPS cells. Overall, this study demonstrates the importance of consistent quantification in highly challenging experimental setups, and proposes an algorithm to automate this task, constituting the last missing piece in a pipeline for automated analysis of massively parallel targeted proteomics datasets.
Collapse
Affiliation(s)
- Hannes L Röst
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Department of Genetics, Stanford University, Stanford, California, USA
| | - Yansheng Liu
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Giuseppe D'Agostino
- Department of Experimental Oncology, European Institute of Oncology, Milan, Italy
| | - Matteo Zanella
- Department of Experimental Oncology, European Institute of Oncology, Milan, Italy
| | - Pedro Navarro
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Institute for Immunology, University Medical Center of the Johannes Gutenberg University of Mainz, Mainz, Germany
| | - George Rosenberger
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Ben C Collins
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Ludovic Gillet
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Giuseppe Testa
- Department of Experimental Oncology, European Institute of Oncology, Milan, Italy.,Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
| | - Lars Malmström
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,S3IT, University of Zurich, Zurich, Switzerland
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Faculty of Science, University of Zurich, Zurich, Switzerland
| |
Collapse
|
64
|
Shi T, Song E, Nie S, Rodland KD, Liu T, Qian WJ, Smith RD. Advances in targeted proteomics and applications to biomedical research. Proteomics 2016; 16:2160-82. [PMID: 27302376 PMCID: PMC5051956 DOI: 10.1002/pmic.201500449] [Citation(s) in RCA: 167] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2015] [Revised: 05/09/2016] [Accepted: 06/10/2016] [Indexed: 12/17/2022]
Abstract
Targeted proteomics technique has emerged as a powerful protein quantification tool in systems biology, biomedical research, and increasing for clinical applications. The most widely used targeted proteomics approach, selected reaction monitoring (SRM), also known as multiple reaction monitoring (MRM), can be used for quantification of cellular signaling networks and preclinical verification of candidate protein biomarkers. As an extension to our previous review on advances in SRM sensitivity (Shi et al., Proteomics, 12, 1074-1092, 2012) herein we review recent advances in the method and technology for further enhancing SRM sensitivity (from 2012 to present), and highlighting its broad biomedical applications in human bodily fluids, tissue and cell lines. Furthermore, we also review two recently introduced targeted proteomics approaches, parallel reaction monitoring (PRM) and data-independent acquisition (DIA) with targeted data extraction on fast scanning high-resolution accurate-mass (HR/AM) instruments. Such HR/AM targeted quantification with monitoring all target product ions addresses SRM limitations effectively in specificity and multiplexing; whereas when compared to SRM, PRM and DIA are still in the infancy with a limited number of applications. Thus, for HR/AM targeted quantification we focus our discussion on method development, data processing and analysis, and its advantages and limitations in targeted proteomics. Finally, general perspectives on the potential of achieving both high sensitivity and high sample throughput for large-scale quantification of hundreds of target proteins are discussed.
Collapse
Affiliation(s)
- Tujin Shi
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Ehwang Song
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Song Nie
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Karin D Rodland
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Tao Liu
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Wei-Jun Qian
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Richard D Smith
- Biological Sciences Division and Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| |
Collapse
|
65
|
Müller DB, Schubert OT, Röst H, Aebersold R, Vorholt JA. Systems-level Proteomics of Two Ubiquitous Leaf Commensals Reveals Complementary Adaptive Traits for Phyllosphere Colonization. Mol Cell Proteomics 2016; 15:3256-3269. [PMID: 27457762 DOI: 10.1074/mcp.m116.058164] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Indexed: 12/24/2022] Open
Abstract
Plants are colonized by a diverse community of microorganisms, the plant microbiota, exhibiting a defined and conserved taxonomic structure. Niche separation based on spatial segregation and complementary adaptation strategies likely forms the basis for coexistence of the various microorganisms in the plant environment. To gain insights into organism-specific adaptations on a molecular level, we selected two exemplary community members of the core leaf microbiota and profiled their proteomes upon Arabidopsis phyllosphere colonization. The highly quantitative mass spectrometric technique SWATH MS was used and allowed for the analysis of over two thousand proteins spanning more than three orders of magnitude in abundance for each of the model strains. The data suggest that Sphingomonas melonis utilizes amino acids and hydrocarbon compounds during colonization of leaves whereas Methylobacterium extorquens relies on methanol metabolism in addition to oxalate metabolism, aerobic anoxygenic photosynthesis and alkanesulfonate utilization. Comparative genomic analyses indicates that utilization of oxalate and alkanesulfonates is widespread among leaf microbiota members whereas, aerobic anoxygenic photosynthesis is almost exclusively found in Methylobacteria. Despite the apparent niche separation between these two strains we also found a relatively small subset of proteins to be coregulated, indicating common mechanisms, underlying successful leaf colonization. Overall, our results reveal for two ubiquitous phyllosphere commensals species-specific adaptations to the host environment and provide evidence for niche separation within the plant microbiota.
Collapse
Affiliation(s)
- Daniel B Müller
- From the ‡Department of Biology, Institute of Microbiology, ETH Zurich, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland
| | - Olga T Schubert
- §Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Auguste-Piccard-Hof 1, 8093 Zurich, Switzerland
| | - Hannes Röst
- §Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Auguste-Piccard-Hof 1, 8093 Zurich, Switzerland
| | - Ruedi Aebersold
- §Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Auguste-Piccard-Hof 1, 8093 Zurich, Switzerland; ¶Faculty of Science, University of Zurich, Zurich, Switzerland
| | - Julia A Vorholt
- From the ‡Department of Biology, Institute of Microbiology, ETH Zurich, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland;
| |
Collapse
|
66
|
Teleman J, Chawade A, Sandin M, Levander F, Malmström J. Dinosaur: A Refined Open-Source Peptide MS Feature Detector. J Proteome Res 2016; 15:2143-51. [PMID: 27224449 PMCID: PMC4933939 DOI: 10.1021/acs.jproteome.6b00016] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
![]()
In bottom-up mass spectrometry (MS)-based
proteomics, peptide isotopic
and chromatographic traces (features) are frequently used for label-free
quantification in data-dependent acquisition MS but can also be used
for the improved identification of chimeric spectra or sample complexity
characterization. Feature detection is difficult because of the high
complexity of MS proteomics data from biological samples, which frequently
causes features to intermingle. In addition, existing feature detection
algorithms commonly suffer from compatibility issues, long computation
times, or poor performance on high-resolution data. Because of these
limitations, we developed a new tool, Dinosaur, with increased speed
and versatility. Dinosaur has the functionality to sample algorithm
computations through quality-control plots, which we call a plot trail.
From the evaluation of this plot trail, we introduce several algorithmic
improvements to further improve the robustness and performance of
Dinosaur, with the detection of features for 98% of MS/MS identifications
in a benchmark data set, and no other algorithm tested in this study
passed 96% feature detection. We finally used Dinosaur to reimplement
a published workflow for peptide identification in chimeric spectra,
increasing chimeric identification from 26% to 32% over the standard
workflow. Dinosaur is operating-system-independent and is freely available
as open source on https://github.com/fickludd/dinosaur.
Collapse
Affiliation(s)
- Johan Teleman
- Department of Immunotechnology, Lund University , 223 83 Lund, Sweden.,Department of Clinical Sciences Lund, Lund University , 221 00 Lund, Sweden
| | - Aakash Chawade
- Department of Immunotechnology, Lund University , 223 83 Lund, Sweden
| | - Marianne Sandin
- Department of Immunotechnology, Lund University , 223 83 Lund, Sweden
| | - Fredrik Levander
- Department of Immunotechnology, Lund University , 223 83 Lund, Sweden.,Bioinformatics Infrastructure for Life Sciences (BILS), Lund University , 223 83 Lund, Sweden
| | - Johan Malmström
- Department of Clinical Sciences Lund, Lund University , 221 00 Lund, Sweden
| |
Collapse
|
67
|
Blattmann P, Heusel M, Aebersold R. SWATH2stats: An R/Bioconductor Package to Process and Convert Quantitative SWATH-MS Proteomics Data for Downstream Analysis Tools. PLoS One 2016; 11:e0153160. [PMID: 27054327 PMCID: PMC4824525 DOI: 10.1371/journal.pone.0153160] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2016] [Accepted: 03/24/2016] [Indexed: 11/19/2022] Open
Abstract
SWATH-MS is an acquisition and analysis technique of targeted proteomics that enables measuring several thousand proteins with high reproducibility and accuracy across many samples. OpenSWATH is popular open-source software for peptide identification and quantification from SWATH-MS data. For downstream statistical and quantitative analysis there exist different tools such as MSstats, mapDIA and aLFQ. However, the transfer of data from OpenSWATH to the downstream statistical tools is currently technically challenging. Here we introduce the R/Bioconductor package SWATH2stats, which allows convenient processing of the data into a format directly readable by the downstream analysis tools. In addition, SWATH2stats allows annotation, analyzing the variation and the reproducibility of the measurements, FDR estimation, and advanced filtering before submitting the processed data to downstream tools. These functionalities are important to quickly analyze the quality of the SWATH-MS data. Hence, SWATH2stats is a new open-source tool that summarizes several practical functionalities for analyzing, processing, and converting SWATH-MS data and thus facilitates the efficient analysis of large-scale SWATH/DIA datasets.
Collapse
Affiliation(s)
- Peter Blattmann
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093, Zurich, Switzerland
- * E-mail:
| | - Moritz Heusel
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093, Zurich, Switzerland
- PhD program in Molecular and Translational Biomedicine, Competence Center Personalized Medicine UZH/ETH & Life Science Zurich Graduate School, ETH Zurich and University of Zurich, 8044, Zurich, Switzerland
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8093, Zurich, Switzerland
- Faculty of Science, University of Zurich, 8057, Zurich, Switzerland
| |
Collapse
|
68
|
Large-scale inference of protein tissue origin in gram-positive sepsis plasma using quantitative targeted proteomics. Nat Commun 2016; 7:10261. [PMID: 26732734 PMCID: PMC4729823 DOI: 10.1038/ncomms10261] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2015] [Accepted: 11/23/2015] [Indexed: 01/30/2023] Open
Abstract
The plasma proteome is highly dynamic and variable, composed of proteins derived from surrounding tissues and cells. To investigate the complex processes that control the composition of the plasma proteome, we developed a mass spectrometry-based proteomics strategy to infer the origin of proteins detected in murine plasma. The strategy relies on the construction of a comprehensive protein tissue atlas from cells and highly vascularized organs using shotgun mass spectrometry. The protein tissue atlas was transformed to a spectral library for highly reproducible quantification of tissue-specific proteins directly in plasma using SWATH-like data-independent mass spectrometry analysis. We show that the method can determine drastic changes of tissue-specific protein profiles in blood plasma from mouse animal models with sepsis. The strategy can be extended to several other species advancing our understanding of the complex processes that contribute to the plasma proteome dynamics. Sepsis can lead to multiple organ failure that could potentially be reflected by change in plasma protein abundance. Here the authors describe a proteomics strategy that allows the determination of plasma proteins tissue origin in a quantitative manner for use as biomarkers—illustrated in a mouse model of sepsis.
Collapse
|
69
|
Bilbao A, Zhang Y, Varesio E, Luban J, Strambio-De-Castillia C, Lisacek F, Hopfgartner G. Ranking Fragment Ions Based on Outlier Detection for Improved Label-Free Quantification in Data-Independent Acquisition LC-MS/MS. J Proteome Res 2015; 14:4581-93. [PMID: 26412574 DOI: 10.1021/acs.jproteome.5b00394] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Data-independent acquisition LC-MS/MS techniques complement supervised methods for peptide quantification. However, due to the wide precursor isolation windows, these techniques are prone to interference at the fragment ion level, which, in turn, is detrimental for accurate quantification. The nonoutlier fragment ion (NOFI) ranking algorithm has been developed to assign low priority to fragment ions affected by interference. By using the optimal subset of high-priority fragment ions, these interfered fragment ions are effectively excluded from quantification. NOFI represents each fragment ion as a vector of four dimensions related to chromatographic and MS fragmentation attributes and applies multivariate outlier detection techniques. Benchmarking conducted on a well-defined quantitative data set (i.e., the SWATH Gold Standard) indicates that NOFI on average is able to accurately quantify 11-25% more peptides than the commonly used Top-N library intensity ranking method. The sum of the area of the Top3-5 NOFIs produces similar coefficients of variation as compared to that with the library intensity method but with more accurate quantification results. On a biologically relevant human dendritic cell digest data set, NOFI properly assigns low-priority ranks to 85% of annotated interferences, resulting in sensitivity values between 0.92 and 0.80, against 0.76 for the Spectronaut interference detection algorithm.
Collapse
Affiliation(s)
- Aivett Bilbao
- Life Sciences Mass Spectrometry, School of Pharmaceutical Sciences, University of Geneva, University of Lausanne , CH-1211 Geneva 4, Switzerland.,Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , CH-1211 Geneva 4, Switzerland
| | - Ying Zhang
- Life Sciences Mass Spectrometry, School of Pharmaceutical Sciences, University of Geneva, University of Lausanne , CH-1211 Geneva 4, Switzerland
| | - Emmanuel Varesio
- Life Sciences Mass Spectrometry, School of Pharmaceutical Sciences, University of Geneva, University of Lausanne , CH-1211 Geneva 4, Switzerland
| | - Jeremy Luban
- Program in Molecular Medicine, University of Massachusetts Medical School , Worcester, Massachusetts 01605, United States
| | - Caterina Strambio-De-Castillia
- Program in Molecular Medicine, University of Massachusetts Medical School , Worcester, Massachusetts 01605, United States
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics , CH-1211 Geneva 4, Switzerland.,Faculty of Sciences, University of Geneva , CH-1211 Geneva 4, Switzerland
| | - Gérard Hopfgartner
- Life Sciences Mass Spectrometry, School of Pharmaceutical Sciences, University of Geneva, University of Lausanne , CH-1211 Geneva 4, Switzerland
| |
Collapse
|
70
|
Malmström L, Bakochi A, Svensson G, Kilsgård O, Lantz H, Petersson AC, Hauri S, Karlsson C, Malmström J. Quantitative proteogenomics of human pathogens using DIA-MS. J Proteomics 2015; 129:98-107. [PMID: 26381203 DOI: 10.1016/j.jprot.2015.09.012] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2015] [Revised: 09/04/2015] [Accepted: 09/09/2015] [Indexed: 11/26/2022]
Abstract
The increasing number of bacterial genomes in combination with reproducible quantitative proteome measurements provides new opportunities to explore how genetic differences modulate proteome composition and virulence. It is challenging to combine genome and proteome data as the underlying genome influences the proteome. We present a strategy to facilitate the integration of genome data from several genetically similar bacterial strains with data-independent analysis mass spectrometry (DIA-MS) for rapid interrogation of the combined data sets. The strategy relies on the construction of a composite genome combining all genetic data in a compact format, which can accommodate the fusion with quantitative peptide and protein information determined via DIA-MS. We demonstrate the method by combining data sets from whole genome sequencing, shotgun MS and DIA-MS from 34 clinical isolates of Streptococcus pyogenes. The data structure allows for fast exploration of the data showing that undetected proteins are on average more amenable to amino acid substitution than expressed proteins. We identified several significantly differentially expressed proteins between invasive and non-invasive strains. The work underlines how integration of whole genome sequencing with accurately quantified proteomes can further advance the interpretation of the relationship between genomes, proteomes and virulence. This article is part of a Special Issue entitled: Computational Proteomics.
Collapse
Affiliation(s)
| | - Anahita Bakochi
- Division of Infection Medicine, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | - Gabriel Svensson
- Division of Infection Medicine, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | - Ola Kilsgård
- Division of Infection Medicine, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | - Henrik Lantz
- Department of Medical Biochemistry and Microbiology/BILS, Uppsala University, Uppsala, Sweden
| | - Ann Cathrine Petersson
- Department of Clinical Microbiology, Division of Laboratory Medicine, Region Skåne, Lund, Sweden
| | - Simon Hauri
- Division of Infection Medicine, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | - Christofer Karlsson
- Division of Infection Medicine, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| | - Johan Malmström
- Division of Infection Medicine, Department of Clinical Sciences Lund, Lund University, Lund, Sweden
| |
Collapse
|
71
|
Parker SJ, Rost H, Rosenberger G, Collins BC, Malmström L, Amodei D, Venkatraman V, Raedschelders K, Van Eyk JE, Aebersold R. Identification of a Set of Conserved Eukaryotic Internal Retention Time Standards for Data-independent Acquisition Mass Spectrometry. Mol Cell Proteomics 2015. [PMID: 26199342 DOI: 10.1074/mcp.o114.042267] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Accurate knowledge of retention time (RT) in liquid chromatography-based mass spectrometry data facilitates peptide identification, quantification, and multiplexing in targeted and discovery-based workflows. Retention time prediction is particularly important for peptide analysis in emerging data-independent acquisition (DIA) experiments such as SWATH-MS. The indexed RT approach, iRT, uses synthetic spiked-in peptide standards (SiRT) to set RT to a unit-less scale, allowing for normalization of peptide RT between different samples and chromatographic set-ups. The obligatory use of SiRTs can be costly and complicates comparisons and data integration if standards are not included in every sample. Reliance on SiRTs also prevents the inclusion of archived mass spectrometry data for generation of the peptide assay libraries central to targeted DIA-MS data analysis. We have identified a set of peptide sequences that are conserved across most eukaryotic species, termed Common internal Retention Time standards (CiRT). In a series of tests to support the appropriateness of the CiRT-based method, we show: (1) the CiRT peptides normalized RT in human, yeast, and mouse cell lysate derived peptide assay libraries and enabled merging of archived libraries for expanded DIA-MS quantitative applications; (2) CiRTs predicted RT in SWATH-MS data within a 2-min margin of error for the majority of peptides; and (3) normalization of RT using the CiRT peptides enabled the accurate SWATH-MS-based quantification of 340 synthetic isotopically labeled peptides that were spiked into either human or yeast cell lysate. To automate and facilitate the use of these CiRT peptide lists or other custom user-defined internal RT reference peptides in DIA workflows, an algorithm was designed to automatically select a high-quality subset of datapoints for robust linear alignment of RT for use. Implementations of this algorithm are available for the OpenSWATH and Skyline platforms. Thus, CiRT peptides can be used alone or as a complement to SiRTs for RT normalization across peptide spectral libraries and in quantitative DIA-MS studies.
Collapse
Affiliation(s)
- Sarah J Parker
- ‡‡Advanced Clinical Biosystems Research Institute, The Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California
| | - Hannes Rost
- §Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland; ¶PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - George Rosenberger
- §Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland; ¶PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
| | - Ben C Collins
- §Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | | | | - Vidya Venkatraman
- ‡‡Advanced Clinical Biosystems Research Institute, The Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California
| | - Koen Raedschelders
- ‡‡Advanced Clinical Biosystems Research Institute, The Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California
| | - Jennifer E Van Eyk
- From the ‡Department of Medicine, Johns Hopkins University, Baltimore Maryland; ‡‡Advanced Clinical Biosystems Research Institute, The Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, California
| | - Ruedi Aebersold
- §Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland; §§Faculty of Science, University of Zurich, Zurich, Switzerland
| |
Collapse
|
72
|
Röst HL, Rosenberger G, Aebersold R, Malmström L. Efficient visualization of high-throughput targeted proteomics experiments: TAPIR: Fig. 1. Bioinformatics 2015; 31:2415-7. [DOI: 10.1093/bioinformatics/btv152] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 03/11/2015] [Indexed: 11/14/2022] Open
|