1
|
Matthiesen R, Carvalho AS. Methods and Algorithms for Quantitative Proteomics by Mass Spectrometry. Methods Mol Biol 2020; 2051:161-197. [PMID: 31552629 DOI: 10.1007/978-1-4939-9744-2_7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Protein quantitation by mass spectrometry has always been a resourceful technique in protein discovery, and more recently it has leveraged the advent of clinical proteomics. A single mass spectrometry analysis experiment provides identification and quantitation of proteins as well as information on posttranslational modifications landscape. By contrast, protein array technologies are restricted to quantitation of targeted proteins and their modifications. Currently, there are an overwhelming number of quantitative mass spectrometry methods for protein and peptide quantitation. The aim here is to provide an overview of the most common mass spectrometry methods and algorithms used in quantitative proteomics and discuss the computational aspects to obtain reliable quantitative measures of proteins, peptides and their posttranslational modifications. The development of a pipeline using commercial or freely available software is one of the main challenges in data analysis of many experimental projects. Recent developments of R statistical programming language make it attractive to fully develop pipelines for quantitative proteomics. We discuss concepts of quantitative proteomics that together with current R packages can be used to build highly customizable pipelines.
Collapse
Affiliation(s)
- Rune Matthiesen
- Computational and Experimental Biology Group, CEDOC, Chronic Diseases Research Centre, NOVA Medical School, Faculdade de Ciências Médicas, Universidade NOVA de Lisboa, Lisboa, Portugal
| | - Ana Sofia Carvalho
- Computational and Experimental Biology Group, CEDOC, Chronic Diseases Research Centre, NOVA Medical School, Faculdade de Ciências Médicas, Universidade NOVA de Lisboa, Lisboa, Portugal.
| |
Collapse
|
2
|
Matthiesen R, Carvalho AS. Methods and algorithms for quantitative proteomics by mass spectrometry. Methods Mol Biol 2013; 1007:183-217. [PMID: 23666727 DOI: 10.1007/978-1-62703-392-3_8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Protein quantitation by mass spectrometry (MS) is attractive since it is possible to obtain both identification and quantitative values of proteins and their posttranslational modifications in a single experiment. In contrast, protein arrays only provide quantitative values of targeted proteins and their modifications. There are an overwhelming number of quantitative MS methods for protein and peptide quantitation. The aim here is to provide an overview of the most common MS methods and algorithms used in quantitative proteomics and discuss the computational algorithms needed to reliably quantitate proteins, peptides, and their posttranslational modifications. One of the main challenges in data analysis of many experimental projects is to pipe together a number of software solutions that are either commercial or freely available. The aim of this chapter is to provide a good set of algorithms, ideas, and resources that can easily be implemented in scripting language like R, Python, or Perl. By understanding the algorithmic ideas presented here, data from any instrument or modified experimental protocol can be analyzed and is therefore in the authors' opinion more valuable than a black box concept.
Collapse
Affiliation(s)
- Rune Matthiesen
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| | | |
Collapse
|
3
|
Rodríguez-Suárez E, Whetton AD. The application of quantification techniques in proteomics for biomedical research. MASS SPECTROMETRY REVIEWS 2013; 32:1-26. [PMID: 22847841 DOI: 10.1002/mas.21347] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2011] [Revised: 02/09/2012] [Accepted: 02/10/2012] [Indexed: 06/01/2023]
Abstract
The systematic analysis of biological processes requires an understanding of the quantitative expression patterns of proteins, their interacting partners and their subcellular localization. This information was formerly difficult to accrue as the relative quantification of proteins relied on antibody-based methods and other approaches with low throughput. The advent of soft ionization techniques in mass spectrometry plus advances in separation technologies has aligned protein systems biology with messenger RNA, DNA, and microarray technologies to provide data on systems as opposed to singular protein entities. Another aspect of quantitative proteomics that increases its importance for the coming few years is the significant technical developments underway both for high pressure liquid chromatography and mass spectrum devices. Hence, robustness, reproducibility and mass accuracy are still improving with every new generation of instruments. Nonetheless, the methods employed require validation and comparison to design fit for purpose experiments in advanced protein analyses. This review considers the newly developed systematic protein investigation methods and their value from the standpoint that relative or absolute protein quantification is required de rigueur in biomedical research.
Collapse
|
4
|
Gonzalez-Galarza FF, Lawless C, Hubbard SJ, Fan J, Bessant C, Hermjakob H, Jones AR. A critical appraisal of techniques, software packages, and standards for quantitative proteomic analysis. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2012; 16:431-42. [PMID: 22804616 DOI: 10.1089/omi.2012.0022] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
New methods for performing quantitative proteome analyses based on differential labeling protocols or label-free techniques are reported in the literature on an almost monthly basis. In parallel, a correspondingly vast number of software tools for the analysis of quantitative proteomics data has also been described in the literature and produced by private companies. In this article we focus on the review of some of the most popular techniques in the field and present a critical appraisal of several software packages available to process and analyze the data produced. We also describe the importance of community standards to support the wide range of software, which may assist researchers in the analysis of data using different platforms and protocols. It is intended that this review will serve bench scientists both as a useful reference and a guide to the selection and use of different pipelines to perform quantitative proteomics data analysis. We have produced a web-based tool ( http://www.proteosuite.org/?q=other_resources ) to help researchers find appropriate software for their local instrumentation, available file formats, and quantitative methodology.
Collapse
|
5
|
Bauer C, Kleinjung F, Rutishauser D, Panse C, Chadt A, Dreja T, Al-Hasani H, Reinert K, Schlapbach R, Schuchhardt J. PPINGUIN: Peptide Profiling Guided Identification of Proteins improves quantitation of iTRAQ ratios. BMC Bioinformatics 2012; 13:34. [PMID: 22340093 PMCID: PMC3368728 DOI: 10.1186/1471-2105-13-34] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2011] [Accepted: 02/16/2012] [Indexed: 01/07/2023] Open
Abstract
Background Recent development of novel technologies paved the way for quantitative proteomics. One of the most important among them is iTRAQ, employing isobaric tags for relative or absolute quantitation. Despite large progress in technology development, still many challenges remain for derivation and interpretation of quantitative results. One of these challenges is the consistent assignment of peptides to proteins. Results We have developed Peptide Profiling Guided Identification of Proteins (PPINGUIN), a statistical analysis workflow for iTRAQ data addressing the problem of ambiguous peptide quantitations. Motivated by the assumption that peptides uniquely derived from the same protein are correlated, our method employs clustering as a very early step in data processing prior to protein inference. Our method increases experimental reproducibility and decreases variability of quantitations of peptides assigned to the same protein. Giving further support to our method, application to a type 2 diabetes dataset identifies a list of protein candidates that is in very good agreement with previously performed transcriptomics meta analysis. Making use of quantitative properties of signal patterns identified, PPINGUIN can reveal new isoform candidates. Conclusions Regarding the increasing importance of quantitative proteomics we think that this method will be useful in practical applications like model fitting or functional enrichment analysis. We recommend to use this method if quantitation is a major objective of research.
Collapse
Affiliation(s)
- Chris Bauer
- MicroDiscovery GmbH, Marienburger Str, 1, 10405 Berlin, Germany.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
The identification of phosphorylation on proteins has become practicable for many laboratories in recent years, largely due to improvements in mass spectrometry (MS) and the development of methods to selectively enrich for phosphorylated peptides and proteins. However, phosphorylation is a dynamic and reversible modification which plays a central role in many biological processes including intracellular signalling. Therefore, the quantitative analysis of phosphorylated proteins and peptides is a subject of intense interest. We discuss three applications of isobaric tags for relative and absolute quantitation (iTRAQ) to the analysis of phosphopeptides from a variety of sample materials.
Collapse
|
7
|
Palmisano G, Thingholm TE. Strategies for quantitation of phosphoproteomic data. Expert Rev Proteomics 2010; 7:439-56. [PMID: 20536313 DOI: 10.1586/epr.10.19] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Recent developments in phosphoproteomic sample-preparation techniques and sensitive mass spectrometry instrumentation have led to large-scale identifications of phosphoproteins and phosphorylation sites from highly complex samples. This has facilitated the implementation of different quantitation strategies in order to study the biological role of protein phosphorylation during disease progression, differentiation or during external stimulation of a cellular system. In this article, a brief summary of the most popular strategies for phosphoproteomic studies is given; however, the main focus will be on different quantitation strategies. Methods for metabolic labeling, chemical modification and label-free quantitation and their applicability or inapplicability in phosphoproteomic studies are discussed.
Collapse
Affiliation(s)
- Giuseppe Palmisano
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M, Denmark
| | | |
Collapse
|
8
|
Rodríguez-Suárez E, Gubb E, Alzueta IF, Falcón-Pérez JM, Amorim A, Elortza F, Matthiesen R. Virtual Expert Mass Spectrometrist: iTRAQ tool for database-dependent search, quantitation and result storage. Proteomics 2010; 10:1545-56. [DOI: 10.1002/pmic.200900255] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
9
|
Matthiesen R, Carvalho AS. Methods and algorithms for relative quantitative proteomics by mass spectrometry. Methods Mol Biol 2010; 593:187-204. [PMID: 19957151 DOI: 10.1007/978-1-60327-194-3_10] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Protein quantitation by mass spectrometry (MS) is attractive since it is possible to obtain both the identification and quantitative values of novel proteins and their posttranslational modifications in one experiment. In contrast, protein arrays only provide quantitative values of targeted proteins and their modifications. There are an overwhelming number of quantitative mass spectrometry (MS) methods for protein and peptide quantitation. The aim here is to provide an overview of the most common MS-based quantitative methods used in the proteomics field and discuss the computational algorithms needed for the robust quantitation of proteins, peptides, and their posttranslational modifications.
Collapse
Affiliation(s)
- Rune Matthiesen
- Instituto de Patologia e Imunologia Molecular da Universidad do Porto - IPATIMUP, Porto, Portugal
| | | |
Collapse
|
10
|
Timms JF, Cutillas PR. Overview of quantitative LC-MS techniques for proteomics and activitomics. Methods Mol Biol 2010; 658:19-45. [PMID: 20839096 DOI: 10.1007/978-1-60761-780-8_2] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
LC-MS is a useful technique for protein and peptide quantification. In addition, as a powerful tool for systems biology research, LC-MS can also be used to quantify post-translational modifications and metabolites that reflect biochemical pathway activity. This review discusses the different analytical techniques that use LC-MS for the quantification of proteins, their modifications and activities in a multiplex manner.
Collapse
Affiliation(s)
- John F Timms
- Cancer Proteomics Laboratory, EGA Institute for Women's Health, University College London, London, UK
| | | |
Collapse
|
11
|
Schwacke JH, Hill EG, Krug EL, Comte-Walters S, Schey KL. iQuantitator: a tool for protein expression inference using iTRAQ. BMC Bioinformatics 2009; 10:342. [PMID: 19835628 PMCID: PMC2770557 DOI: 10.1186/1471-2105-10-342] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2009] [Accepted: 10/18/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Isobaric Tags for Relative and Absolute Quantitation (iTRAQ) [Applied Biosystems] have seen increased application in differential protein expression analysis. To facilitate the growing need to analyze iTRAQ data, especially for cases involving multiple iTRAQ experiments, we have developed a modeling approach, statistical methods, and tools for estimating the relative changes in protein expression under various treatments and experimental conditions. RESULTS This modeling approach provides a unified analysis of data from multiple iTRAQ experiments and links the observed quantity (reporter ion peak area) to the experiment design and the calculated quantity of interest (treatment-dependent protein and peptide fold change) through an additive model under log transformation. Others have demonstrated, through a case study, this modeling approach and noted the computational challenges of parameter inference in the unbalanced data set typical of multiple iTRAQ experiments. Here we present the development of an inference approach, based on hierarchical regression with batching of regression coefficients and Markov Chain Monte Carlo (MCMC) methods that overcomes some of these challenges. In addition to our discussion of the underlying method, we also present our implementation of the software, simulation results, experimental results, and sample output from the resulting analysis report. CONCLUSION iQuantitator's process-based modeling approach overcomes limitations in current methods and allows for application in a variety of experimental designs. Additionally, hypertext-linked documents produced by the tool aid in the interpretation and exploration of results.
Collapse
Affiliation(s)
- John H Schwacke
- Department of Biochemistry, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Elizabeth G Hill
- Department of Medicine, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Edward L Krug
- Department of Cell Biology and Anatomy, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Susana Comte-Walters
- Department of Pharmacology, Medical University of South Carolina, Charleston, South Carolina, USA
| | - Kevin L Schey
- Mass Spectrometry Research Center, Vanderbilt University, Nashville, Tennessee, USA
| |
Collapse
|
12
|
Wang Z, Gucek M, Hart GW. Cross-talk between GlcNAcylation and phosphorylation: site-specific phosphorylation dynamics in response to globally elevated O-GlcNAc. Proc Natl Acad Sci U S A 2008; 105:13793-8. [PMID: 18779572 PMCID: PMC2544533 DOI: 10.1073/pnas.0806216105] [Citation(s) in RCA: 259] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2008] [Indexed: 11/18/2022] Open
Abstract
Protein GlcNAcylation serves as a nutrient/stress sensor to modulate the functions of many nuclear and cytoplasmic proteins. O-GlcNAc cycles on serine or threonine residues like phosphorylation, is nearly as abundant, and functions, at least partially, via its interplay with phosphorylation. Here, we describe changes in site-specific phosphorylation dynamics in response to globally elevated GlcNAcylation. By combining sequential phospho-residue enrichment, iTRAQ labeling, and high throughput mass spectrometric analyses, phosphorylation dynamics on 711 phosphopeptides were quantified. Based upon their insensitivity to phosphatase inhibition, we conclude that approximately 48% of these phosphorylation sites were not actively cycling in the conditions of the study. However, increased GlcNAcylation influenced phosphate stoichiometry at most of the sites that did appear to be actively cycling. Elevated GlcNAcylation resulted in lower phosphorylation at 280 sites and caused increased phosphorylation at 148 sites. Thus, the cross-talk or interplay between these two abundant posttranslational modifications is extensive, and may arises both by steric competition for occupancy at the same or proximal sites and by each modification regulating the other's enzymatic machinery. The phosphoproteome dynamics presented by this large set of quantitative data not only delineates the complex interplay between phosphorylation and GlcNAcyation, but also provides insights for more focused investigations of specific roles of O-GlcNAc in regulating protein functions and signaling pathways.
Collapse
Affiliation(s)
- Zihao Wang
- Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD 21205-2185
| | - Marjan Gucek
- Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD 21205-2185
| | - Gerald W. Hart
- Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD 21205-2185
| |
Collapse
|