1
|
Madej D, Lam H. PyViscount: Validating False Discovery Rate Estimation Methods via Random Search Space Partition. J Proteome Res 2025; 24:1118-1134. [PMID: 39905949 PMCID: PMC11894659 DOI: 10.1021/acs.jproteome.4c00743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Revised: 01/20/2025] [Accepted: 01/28/2025] [Indexed: 02/06/2025]
Abstract
Validating false discovery rate (FDR) estimation is an essential but surprisingly understudied aspect of method development in shotgun proteomics. Currently available validation protocols mostly rely on ground truth data sets, which typically involve manipulating the properties of the search space or query spectra used. As a result, comparing estimated FDR and ground truth-based false discovery proportion values may not be representative of the scenarios involving natural data sets encountered in practice. In this study, we introduce PyViscount─a Python tool implementing a novel validation protocol based on random search space partition, which enables generating a quasi ground-truth using unaltered search spaces of unique candidate peptides and generic data sets of experimental query spectra. Furthermore, validation of existing FDR estimation methods by PyViscount is consistent with alternative validation protocols. The presented novel approach to validation free from the need for synthetic data sets or dubious manipulation of the data may be an attractive alternative for proteomics practitioners, allowing them to obtain deeper insights into the performance of existing and new FDR estimation methods.
Collapse
Affiliation(s)
- Dominik Madej
- Department of Chemical and
Biological Engineering, The Hong Kong University
of Science and Technology, Hong Kong 999077, China
| | - Henry Lam
- Department of Chemical and
Biological Engineering, The Hong Kong University
of Science and Technology, Hong Kong 999077, China
| |
Collapse
|
2
|
Mansuri MS, Bathla S, Lam TT, Nairn AC, Williams KR. Optimal conditions for carrying out trypsin digestions on complex proteomes: From bulk samples to single cells. J Proteomics 2024; 297:105109. [PMID: 38325732 PMCID: PMC10939724 DOI: 10.1016/j.jprot.2024.105109] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/10/2024] [Accepted: 01/31/2024] [Indexed: 02/09/2024]
Abstract
To identify proteins by the bottom-up mass spectrometry workflow, enzymatic digestion is essential to break down proteins into smaller peptides amenable to both chromatographic separation and mass spectrometric analysis. Trypsin is the most extensively used protease due to its high cleavage specificity and generation of peptides with desirable positively charged N- and C-terminal amino acid residues that are amenable to reverse phase HPLC separation and MS/MS analyses. However, trypsin can yield variable digestion profiles and its protein cleavage activity is interdependent on trypsin source and quality, digestion time and temperature, pH, denaturant, trypsin and substrate concentrations, composition/complexity of the sample matrix, and other factors. There is therefore a need for a more standardized, general-purpose trypsin digestion protocol. Based on a review of the literature we delineate optimal conditions for carrying out trypsin digestions of complex proteomes from bulk samples to limiting amounts of protein extracts. Furthermore, we highlight recent developments and technological advances used in digestion protocols to quantify complex proteomes from single cells. SIGNIFICANCE: Currently, bottom-up MS-based proteomics is the method of choice for global proteome analysis. Since trypsin is the most utilized protease in bottom-up MS proteomics, delineating optimal conditions for carrying out trypsin digestions of complex proteomes in samples ranging from tissues to single cells should positively impact a broad range of biomedical research.
Collapse
Affiliation(s)
- M Shahid Mansuri
- Yale/NIDA Neuroproteomics Center, New Haven, CT 06511, USA; Molecular Biophysics and Biochemistry, Yale University School of Medicine, New Haven, CT 06511, USA.
| | - Shveta Bathla
- Yale/NIDA Neuroproteomics Center, New Haven, CT 06511, USA; Department of Psychiatry, Yale School of Medicine, New Haven, CT 06511, USA
| | - TuKiet T Lam
- Yale/NIDA Neuroproteomics Center, New Haven, CT 06511, USA; Molecular Biophysics and Biochemistry, Yale University School of Medicine, New Haven, CT 06511, USA; Keck MS & Proteomics Resource, Yale School of Medicine, New Haven, CT 06511, USA
| | - Angus C Nairn
- Yale/NIDA Neuroproteomics Center, New Haven, CT 06511, USA; Department of Psychiatry, Yale School of Medicine, New Haven, CT 06511, USA
| | - Kenneth R Williams
- Yale/NIDA Neuroproteomics Center, New Haven, CT 06511, USA; Molecular Biophysics and Biochemistry, Yale University School of Medicine, New Haven, CT 06511, USA; Keck MS & Proteomics Resource, Yale School of Medicine, New Haven, CT 06511, USA.
| |
Collapse
|
3
|
Abstract
Protein Biotechnology is an exciting and fast- growing area of research, with numerous industrial applications. The growing demand for developing efficient and rapid protein purification methods is driving research and growth in this area. Advances and progress in the techniques and methods of protein purification have been such that one can reasonably expect that any protein of a given order of stability may be purified to currently acceptable standards of homogeneity. However, protein manufacturing cost remains extremely high, with downstream processing constituting a substantial proportion of the overall cost. Understanding of the methods and optimization of the experimental conditions have become critical to the manufacturing industry in order to minimize production costs while satisfying the quality as well as all regulatory requirements. New purification processes exploiting specific, effective and robust methods and chromatographic materials are expected to guide the future of the protein purification market.
Collapse
|
4
|
Verheggen K, Raeder H, Berven FS, Martens L, Barsnes H, Vaudel M. Anatomy and evolution of database search engines-a central component of mass spectrometry based proteomic workflows. MASS SPECTROMETRY REVIEWS 2020; 39:292-306. [PMID: 28902424 DOI: 10.1002/mas.21543] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 07/05/2017] [Indexed: 06/07/2023]
Abstract
Sequence database search engines are bioinformatics algorithms that identify peptides from tandem mass spectra using a reference protein sequence database. Two decades of development, notably driven by advances in mass spectrometry, have provided scientists with more than 30 published search engines, each with its own properties. In this review, we present the common paradigm behind the different implementations, and its limitations for modern mass spectrometry datasets. We also detail how the search engines attempt to alleviate these limitations, and provide an overview of the different software frameworks available to the researcher. Finally, we highlight alternative approaches for the identification of proteomic mass spectrometry datasets, either as a replacement for, or as a complement to, sequence database search engines.
Collapse
Affiliation(s)
- Kenneth Verheggen
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biochemistry, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Helge Raeder
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Norway
- Department of Pediatrics, Haukeland University Hospital, Bergen, Norway
| | - Frode S Berven
- Proteomics Unit, Department of Biomedicine, University of Bergen, Norway
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biochemistry, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Harald Barsnes
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Norway
- Proteomics Unit, Department of Biomedicine, University of Bergen, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Norway
| | - Marc Vaudel
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Norway
- Proteomics Unit, Department of Biomedicine, University of Bergen, Norway
- Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| |
Collapse
|
5
|
Murtaza N, Uy J, Singh KK. Emerging proteomic approaches to identify the underlying pathophysiology of neurodevelopmental and neurodegenerative disorders. Mol Autism 2020; 11:27. [PMID: 32317014 PMCID: PMC7171839 DOI: 10.1186/s13229-020-00334-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 04/06/2020] [Indexed: 12/18/2022] Open
Abstract
Proteomics is the large-scale study of the total protein content and their overall function within a cell through multiple facets of research. Advancements in proteomic methods have moved past the simple quantification of proteins to the identification of post-translational modifications (PTMs) and the ability to probe interactions between these proteins, spatially and temporally. Increased sensitivity and resolution of mass spectrometers and sample preparation protocols have drastically reduced the large amount of cells required and the experimental variability that had previously hindered its use in studying human neurological disorders. Proteomics offers a new perspective to study the altered molecular pathways and networks that are associated with autism spectrum disorders (ASD). The differences between the transcriptome and proteome, combined with the various types of post-translation modifications that regulate protein function and localization, highlight a novel level of research that has not been appropriately investigated. In this review, we will discuss strategies using proteomics to study ASD and other neurological disorders, with a focus on how these approaches can be combined with induced pluripotent stem cell (iPSC) studies. Proteomic analysis of iPSC-derived neurons have already been used to measure changes in the proteome caused by patient mutations, analyze changes in PTMs that resulted in altered biological pathways, and identify potential biomarkers. Further advancements in both proteomic techniques and human iPSC differentiation protocols will continue to push the field towards better understanding ASD disease pathophysiology. Proteomics using iPSC-derived neurons from individuals with ASD offers a window for observing the altered proteome, which is necessary in the future development of therapeutics against specific targets.
Collapse
Affiliation(s)
- Nadeem Murtaza
- Stem Cell and Cancer Research Institute, Department of Biochemistry and Biomedical Sciences, McMaster University, 1280 Main Street West, Hamilton, ON, L8N 3Z5, Canada
| | - Jarryll Uy
- Stem Cell and Cancer Research Institute, Department of Biochemistry and Biomedical Sciences, McMaster University, 1280 Main Street West, Hamilton, ON, L8N 3Z5, Canada
| | - Karun K Singh
- Stem Cell and Cancer Research Institute, Department of Biochemistry and Biomedical Sciences, McMaster University, 1280 Main Street West, Hamilton, ON, L8N 3Z5, Canada.
| |
Collapse
|
6
|
Moritz CP, Paul S, Stoevesandt O, Tholance Y, Camdessanché JP, Antoine JC. Autoantigenomics: Holistic characterization of autoantigen repertoires for a better understanding of autoimmune diseases. Autoimmun Rev 2020; 19:102450. [PMID: 31838165 DOI: 10.1016/j.autrev.2019.102450] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 10/16/2019] [Indexed: 12/13/2022]
|
7
|
Muth T, Renard BY. Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification? Brief Bioinform 2019; 19:954-970. [PMID: 28369237 DOI: 10.1093/bib/bbx033] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Indexed: 01/24/2023] Open
Abstract
While peptide identifications in mass spectrometry (MS)-based shotgun proteomics are mostly obtained using database search methods, high-resolution spectrum data from modern MS instruments nowadays offer the prospect of improving the performance of computational de novo peptide sequencing. The major benefit of de novo sequencing is that it does not require a reference database to deduce full-length or partial tag-based peptide sequences directly from experimental tandem mass spectrometry spectra. Although various algorithms have been developed for automated de novo sequencing, the prediction accuracy of proposed solutions has been rarely evaluated in independent benchmarking studies. The main objective of this work is to provide a detailed evaluation on the performance of de novo sequencing algorithms on high-resolution data. For this purpose, we processed four experimental data sets acquired from different instrument types from collision-induced dissociation and higher energy collisional dissociation (HCD) fragmentation mode using the software packages Novor, PEAKS and PepNovo. Moreover, the accuracy of these algorithms is also tested on ground truth data based on simulated spectra generated from peak intensity prediction software. We found that Novor shows the overall best performance compared with PEAKS and PepNovo with respect to the accuracy of correct full peptide, tag-based and single-residue predictions. In addition, the same tool outpaced the commercial competitor PEAKS in terms of running time speedup by factors of around 12-17. Despite around 35% prediction accuracy for complete peptide sequences on HCD data sets, taken as a whole, the evaluated algorithms perform moderately on experimental data but show a significantly better performance on simulated data (up to 84% accuracy). Further, we describe the most frequently occurring de novo sequencing errors and evaluate the influence of missing fragment ion peaks and spectral noise on the accuracy. Finally, we discuss the potential of de novo sequencing for now becoming more widely used in the field.
Collapse
Affiliation(s)
- Thilo Muth
- Research Group Bioinformatics, Robert Koch Institute, Berlin, Germany
| | - Bernhard Y Renard
- Research Group Bioinformatics, Robert Koch Institute, Berlin, Germany
| |
Collapse
|
8
|
Hernandez-Valladares M, Vaudel M, Selheim F, Berven F, Bruserud Ø. Proteogenomics approaches for studying cancer biology and their potential in the identification of acute myeloid leukemia biomarkers. Expert Rev Proteomics 2017; 14:649-663. [DOI: 10.1080/14789450.2017.1352474] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
- Maria Hernandez-Valladares
- Department of Clinical Science, Faculty of Medicine and Dentistry, University of Bergen, Bergen, Norway
- Proteomics Unit, Department of Biomedicine, Faculty of Medicine and Dentistry, University of Bergen, Bergen, Norway
| | - Marc Vaudel
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, Faculty of Medicine and Dentistry, University of Bergen, Bergen, Norway
- Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| | - Frode Selheim
- Proteomics Unit, Department of Biomedicine, Faculty of Medicine and Dentistry, University of Bergen, Bergen, Norway
| | - Frode Berven
- Proteomics Unit, Department of Biomedicine, Faculty of Medicine and Dentistry, University of Bergen, Bergen, Norway
| | - Øystein Bruserud
- Department of Clinical Science, Faculty of Medicine and Dentistry, University of Bergen, Bergen, Norway
| |
Collapse
|
9
|
Jelonek K, Pietrowska M, Widlak P. Systemic effects of ionizing radiation at the proteome and metabolome levels in the blood of cancer patients treated with radiotherapy: the influence of inflammation and radiation toxicity. Int J Radiat Biol 2017; 93:683-696. [PMID: 28281355 DOI: 10.1080/09553002.2017.1304590] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
PURPOSE Blood is the most common replacement tissue used to study systemic responses of organisms to different types of pathological conditions and environmental insults. Local irradiation during cancer radiotherapy induces whole body responses that can be observed at the blood proteome and metabolome levels. Hence, comparative blood proteomics and metabolomics are emerging approaches used in the discovery of radiation biomarkers. These techniques enable the simultaneous measurement of hundreds of molecules and the identification of sets of components that can discriminate different physiological states of the human body. Radiation-induced changes are affected by the dose and volume of irradiated tissues; hence, the molecular composition of blood is a hypothetical source of biomarkers for dose assessment and the prediction and monitoring of systemic responses to radiation. This review aims to provide a comprehensive overview on the available evidence regarding molecular responses to ionizing radiation detected at the level of the human blood proteome and metabolome. It focuses on patients exposed to radiation during cancer radiotherapy and emphasizes effects related to radiation-induced toxicity and inflammation. CONCLUSIONS Systemic responses to radiation detected at the blood proteome and metabolome levels are primarily related to the intensity of radiation-induced toxicity, including inflammatory responses. Thus, several inflammation-associated molecules can be used to monitor or even predict radiation-induced toxicity. However, these abundant molecular features have a rather limited applicability as universal biomarkers for dose assessment, reflecting the individual predisposition of the immune system and tissue-specific mechanisms involved in radiation-induced damage.
Collapse
Affiliation(s)
- Karol Jelonek
- a Center for Translational Research and Molecular Biology of Cancer , Maria Sklodowska-Curie Institute - Oncology Center Gliwice Branch , Gliwice , Poland
| | - Monika Pietrowska
- a Center for Translational Research and Molecular Biology of Cancer , Maria Sklodowska-Curie Institute - Oncology Center Gliwice Branch , Gliwice , Poland
| | - Piotr Widlak
- a Center for Translational Research and Molecular Biology of Cancer , Maria Sklodowska-Curie Institute - Oncology Center Gliwice Branch , Gliwice , Poland
| |
Collapse
|
10
|
Abstract
Scoring functions that assess spectrum similarity play a crucial role in many computational mass spectrometry algorithms. These functions are used to compare an experimentally acquired fragmentation (MS/MS) spectrum against two different types of target MS/MS spectra: either against a theoretical MS/MS spectrum derived from a peptide from a sequence database, or against another, previously acquired MS/MS spectrum. The former is typically encountered in database searching, while the latter is used in spectrum clustering and spectral library searching. The comparison between acquired versus theoretical MS/MS spectra is most commonly performed using cross-correlations or probability derived scoring functions, while the comparison of two acquired MS/MS spectra typically makes use of a normalized dot product, especially in spectrum library search algorithms. In addition to these scoring functions, Pearson's or Spearman's correlation coefficients, mean squared error, or median absolute deviation scores can also be used for the same purpose. Here, we describe and evaluate these scoring functions with regards to their ability to assess spectrum similarity for theoretical versus acquired, and acquired versus acquired spectra.
Collapse
Affiliation(s)
- Şule Yilmaz
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent, 9000, Belgium
- Department of Biochemistry, Ghent University, Ghent, 9000, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, 9000, Belgium
| | - Elien Vandermarliere
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent, 9000, Belgium
- Department of Biochemistry, Ghent University, Ghent, 9000, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, 9000, Belgium
| | - Lennart Martens
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent, 9000, Belgium.
- Department of Biochemistry, Ghent University, Ghent, 9000, Belgium.
- Bioinformatics Institute Ghent, Ghent University, Ghent, 9000, Belgium.
| |
Collapse
|
11
|
Muth T, Renard BY, Martens L. Metaproteomic data analysis at a glance: advances in computational microbial community proteomics. Expert Rev Proteomics 2016; 13:757-69. [DOI: 10.1080/14789450.2016.1209418] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
12
|
Yılmaz Ş, Victor B, Hulstaert N, Vandermarliere E, Barsnes H, Degroeve S, Gupta S, Sticker A, Gabriël S, Dorny P, Palmblad M, Martens L. A Pipeline for Differential Proteomics in Unsequenced Species. J Proteome Res 2016; 15:1963-70. [DOI: 10.1021/acs.jproteome.6b00140] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Şule Yılmaz
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent B-9000, Belgium
- Department
of Biochemistry, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Bioinformatics
Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Bjorn Victor
- Veterinary
Helminthology Unit, Department of Biomedical Sciences, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Niels Hulstaert
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent B-9000, Belgium
- Department
of Biochemistry, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Bioinformatics
Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Elien Vandermarliere
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent B-9000, Belgium
- Department
of Biochemistry, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Bioinformatics
Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Harald Barsnes
- Proteomics
Unit (PROBE), Department of Biomedicine, University of Bergen, Jonas Liesvei 91, N-5009 Bergen, Norway
| | - Sven Degroeve
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent B-9000, Belgium
- Department
of Biochemistry, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Bioinformatics
Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Surya Gupta
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent B-9000, Belgium
- Department
of Biochemistry, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Bioinformatics
Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Adriaan Sticker
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent B-9000, Belgium
- Department
of Biochemistry, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Bioinformatics
Institute Ghent, Ghent University, B-9052 Ghent, Belgium
- Department
of Applied Mathematics, Computer Science, and Statistics, Ghent University, B-9000 Ghent, Belgium
| | - Sarah Gabriël
- Veterinary
Helminthology Unit, Department of Biomedical Sciences, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Pierre Dorny
- Veterinary
Helminthology Unit, Department of Biomedical Sciences, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Magnus Palmblad
- Center
for Proteomics and Metabolomics, Leiden University Medical Center, 2300 RC Leiden, The Netherlands
| | - Lennart Martens
- Medical Biotechnology Center, VIB, Albert Baertsoenkaai 3, Ghent B-9000, Belgium
- Department
of Biochemistry, Ghent University, Albert Baertsoenkaai 3, B-9000 Ghent, Belgium
- Bioinformatics
Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| |
Collapse
|
13
|
Martens L. Public proteomics data: How the field has evolved from sceptical inquiry to the promise of in silico proteomics. EUPA OPEN PROTEOMICS 2016; 11:42-44. [PMID: 29900110 PMCID: PMC5988554 DOI: 10.1016/j.euprot.2016.02.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2015] [Revised: 02/13/2016] [Accepted: 02/15/2016] [Indexed: 12/23/2022]
Abstract
Proteomics data sharing moved from validation to re-use. New tools and services make data very easily accessible. Metadata provision can still benefit from improvements. Quality control metrics will soon be reported along with submitted data. Data re-use will enable the advent of actual in silico proteomics.
Collapse
Affiliation(s)
- Lennart Martens
- Department of Medical Protein Research, VIB 9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, 9000 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
14
|
Guldbrandsen A, Barsnes H, Kroksveen AC, Berven FS, Vaudel M. A Simple Workflow for Large Scale Shotgun Glycoproteomics. Methods Mol Biol 2016; 1394:275-286. [PMID: 26700056 DOI: 10.1007/978-1-4939-3341-9_20] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Targeting subproteomes is a good strategy to decrease the complexity of a sample, for example in body fluid biomarker studies. Glycoproteins are proteins with carbohydrates of varying size and structure attached to the polypeptide chain, and it has been shown that glycosylation plays essential roles in several vital cellular processes, making glycosylation a particularly interesting field of study. Here, we describe a method for the enrichment of glycosylated peptides from trypsin digested proteins in human cerebrospinal fluid. We also describe how to perform the data analysis on the mass spectrometry data for such samples, focusing on site-specific identification of glycosylation sites, using user friendly open source software.
Collapse
Affiliation(s)
- Astrid Guldbrandsen
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, Bergen, Norway
| | - Harald Barsnes
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- KG Jebsen Center for Diabetes Research, Department of Clinical Sciences, University of Bergen, Bergen, Norway
| | - Ann Cathrine Kroksveen
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, Bergen, Norway
| | - Frode S Berven
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, Bergen, Norway
- Norwegian Multiple Sclerosis Competence Centre, Department of Neurology, Haukeland University Hospital, Bergen, Norway
| | - Marc Vaudel
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway.
| |
Collapse
|
15
|
Vaudel M, Verheggen K, Csordas A, Raeder H, Berven FS, Martens L, Vizcaíno JA, Barsnes H. Exploring the potential of public proteomics data. Proteomics 2016; 16:214-25. [PMID: 26449181 PMCID: PMC4738454 DOI: 10.1002/pmic.201500295] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Revised: 08/25/2015] [Accepted: 09/28/2015] [Indexed: 12/22/2022]
Abstract
In a global effort for scientific transparency, it has become feasible and good practice to share experimental data supporting novel findings. Consequently, the amount of publicly available MS-based proteomics data has grown substantially in recent years. With some notable exceptions, this extensive material has however largely been left untouched. The time has now come for the proteomics community to utilize this potential gold mine for new discoveries, and uncover its untapped potential. In this review, we provide a brief history of the sharing of proteomics data, showing ways in which publicly available proteomics data are already being (re-)used, and outline potential future opportunities based on four different usage types: use, reuse, reprocess, and repurpose. We thus aim to assist the proteomics community in stepping up to the challenge, and to make the most of the rapidly increasing amount of public proteomics data.
Collapse
Affiliation(s)
- Marc Vaudel
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
| | - Kenneth Verheggen
- Medical Biotechnology Center, VIB, Ghent, Belgium
- Department of Biochemistry, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Attila Csordas
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Helge Raeder
- Department of Clinical Science, KG Jebsen Center for Diabetes Research, University of Bergen, Bergen, Norway
| | - Frode S Berven
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- Department of Clinical Medicine, KG Jebsen Centre for Multiple Sclerosis Research, University of Bergen, Bergen, Norway
| | - Lennart Martens
- Medical Biotechnology Center, VIB, Ghent, Belgium
- Department of Biochemistry, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Juan A Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Harald Barsnes
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- Department of Clinical Science, KG Jebsen Center for Diabetes Research, University of Bergen, Bergen, Norway
| |
Collapse
|
16
|
Lereim RR, Oveland E, Berven FS, Vaudel M, Barsnes H. Visualization, Inspection and Interpretation of Shotgun Proteomics Identification Results. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 919:227-235. [DOI: 10.1007/978-3-319-41448-5_11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
17
|
Vaudel M, Barsnes H, Ræder H, Berven FS. Using Proteomics Bioinformatics Tools and Resources in Proteogenomic Studies. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 926:65-75. [DOI: 10.1007/978-3-319-42316-6_5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
18
|
Virus ED, Ivanov AV, Luzyanin BP, Kubatiev AA. Some aspects of experimental design in targeted proteomics based on the use of selected reaction monitoring and isotope-labeled peptides. JOURNAL OF ANALYTICAL CHEMISTRY 2015. [DOI: 10.1134/s1061934815130109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
19
|
Bari MG, Ramirez N, Wang Z, Zhang J(M. MZDASoft: a software architecture that enables large-scale comparison of protein expression levels over multiple samples based on liquid chromatography/tandem mass spectrometry. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2015; 29:1841-1848. [PMID: 26331936 PMCID: PMC4560111 DOI: 10.1002/rcm.7272] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Revised: 06/22/2015] [Accepted: 07/04/2015] [Indexed: 06/05/2023]
Abstract
RATIONALE Without accurate peak linking/alignment, only the expression levels of a small percentage of proteins can be compared across multiple samples in Liquid Chromatography/Mass Spectrometry/Tandem Mass Spectrometry (LC/MS/MS) due to the selective nature of tandem MS peptide identification. This greatly hampers biomedical research that aims at finding biomarkers for disease diagnosis, treatment, and the understanding of disease mechanisms. A recent algorithm, PeakLink, has allowed the accurate linking of LC/MS peaks without tandem MS identifications to their corresponding ones with identifications across multiple samples collected from different instruments, tissues and labs, which greatly enhanced the ability of comparing proteins. However, PeakLink cannot be implemented practically for large numbers of samples based on existing software architectures, because it requires access to peak elution profiles from multiple LC/MS/MS samples simultaneously. METHODS We propose a new architecture based on parallel processing, which extracts LC/MS peak features, and saves them in database files to enable the implementation of PeakLink for multiple samples. The software has been deployed in High-Performance Computing (HPC) environments. The core part of the software, MZDASoft Parallel Peak Extractor (PPE), can be downloaded with a user and developer's guide, and it can be run on HPC centers directly. The quantification applications, MZDASoft TandemQuant and MZDASoft PeakLink, are written in Matlab, which are compiled with a Matlab runtime compiler. A sample script that incorporates all necessary processing steps of MZDASoft for LC/MS/MS quantification in a parallel processing environment is available. The project webpage is http://compgenomics.utsa.edu/zgroup/MZDASoft. RESULTS The proposed architecture enables the implementation of PeakLink for multiple samples. Significantly more (100%-500%) proteins can be compared over multiple samples with better quantification accuracy in test cases. CONCLUSION MZDASoft enables large-scale comparison of protein expression levels over multiple samples with much larger protein comparison coverage and better quantification accuracy. It is an efficient implementation based on parallel processing which can be used to process large amounts of data.
Collapse
Affiliation(s)
- Mehrab Ghanat Bari
- Department of Electrical and Computer Engineering, Univ. of Texas at San Antonio, One UTSA Circle, San Antonio, TX 782
| | - Nelson Ramirez
- Computational Biology Initiative, Univ. of Texas at San Antonio, One UTSA Circle, San Antonio, TX 78249
| | - Zhiwei Wang
- Computational Biology Initiative, Univ. of Texas at San Antonio, One UTSA Circle, San Antonio, TX 78249
| | - Jianqiu (Michelle) Zhang
- Department of Electrical and Computer Engineering, Univ. of Texas at San Antonio, One UTSA Circle, San Antonio, TX 782
| |
Collapse
|
20
|
Shah DJ, Rohlfing F, Anand S, Johnson WE, Alvarez MTB, Cobell J, King J, Young SA, Kauwe JS, Graves SW. Discovery and Subsequent Confirmation of Novel Serum Biomarkers Diagnosing Alzheimer’s Disease. J Alzheimers Dis 2015; 49:317-27. [DOI: 10.3233/jad-150498] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Dipti Jigar Shah
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | | | - Swati Anand
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - W. Evan Johnson
- Division of Computational Biomedicine, Boston University School of Medicine, Boston University, Boston, MA, USA
| | | | - Jesse Cobell
- Department of Biology, Brigham Young University, Provo, UT, USA
| | - Jackson King
- Department of Biology, Brigham Young University, Provo, UT, USA
| | - Sydney A. Young
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - John S.K. Kauwe
- Department of Biology, Brigham Young University, Provo, UT, USA
| | - Steven W. Graves
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| |
Collapse
|
21
|
Abramowicz A, Wojakowska A, Gdowicz-Klosok A, Polanska J, Rodziewicz P, Polanowski P, Namysl-Kaletka A, Pietrowska M, Wydmanski J, Widlak P. Identification of serum proteome signatures of locally advanced and metastatic gastric cancer: a pilot study. J Transl Med 2015; 13:304. [PMID: 26376850 PMCID: PMC4574216 DOI: 10.1186/s12967-015-0668-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2015] [Accepted: 09/10/2015] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The gastric cancer is one of the most common and mortal cancer worldwide. The initial asymptomatic development and further nonspecific symptoms result in diagnosis at the advanced stage with poor prognosis. Yet, no clinically useful biomarkers are available for this malignancy, and invasive gastrointestinal endoscopy remains the only reliable option at the moment. Hence, there is a need for discovery of clinically useful noninvasive diagnostic and/or prognostic tool as an alternative (or complement) for current diagnostic tools. Here we aimed to search for serum proteins characteristic for local and invasive gastric cancer. METHODS Pre-treatment blood samples were collected from patients with diagnosed gastric adenocarcinoma at the different stage of disease: 35 patients with locally advanced cancer and 18 patients with metastatic cancer; 50 healthy donors were also included as a control group. The low-molecular-weight fraction of serum proteome (i.e., endogenous peptidome) was profiled by the MALDI-ToF mass spectrometry, and the whole proteome components were identified and quantified by the LC-MS/MS shotgun approach. RESULTS Multicomponent peptidome signatures were revealed that allowed good discrimination between healthy controls and cancer patients, as well as between patients with locally advanced and metastatic cancer. Moreover, a LC-MS/MS approach revealed 49 serum proteins with different abundances between healthy donors and cancer patients (predominantly proteins associated with inflammation and acute phase response). Furthermore, 19 serum proteins with different abundances between patients with locally advanced and metastatic cancer were identified (including proteins associated with cytokine/chemokine response and metabolism of nucleic acids). However, neither peptidome profiling nor shotgun proteomics approach allowed detecting serum components discriminating between two subgroups of patients with local disease who either developed or did not develop metastases during follow-up. CONCLUSIONS The molecular differences between locally advanced and metastatic gastric cancer, as well as more obvious differences between healthy individuals and cancer patients, have marked reflection at the level of serum proteome. However, we have no evidence that features of pre-treatment serum proteome could predict a risk of cancer dissemination in patients treated due to local disease. Nevertheless, presented data confirmed potential applicability of a serum proteome signature-based biomarker in diagnostics of gastric cancer.
Collapse
Affiliation(s)
- Agata Abramowicz
- Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Gliwice, Poland.
| | - Anna Wojakowska
- Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Gliwice, Poland.
| | | | | | - Pawel Rodziewicz
- Institute of Bioorganic Chemistry, Polish Academy of Science, Poznan, Poland.
| | - Pawel Polanowski
- Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Gliwice, Poland.
| | | | - Monika Pietrowska
- Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Gliwice, Poland.
| | - Jerzy Wydmanski
- Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Gliwice, Poland.
| | - Piotr Widlak
- Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Gliwice, Poland.
| |
Collapse
|
22
|
Widlak P, Jelonek K, Wojakowska A, Pietrowska M, Polanska J, Marczak Ł, Miszczyk L, Składowski K. Serum Proteome Signature of Radiation Response: Upregulation of Inflammation-Related Factors and Downregulation of Apolipoproteins and Coagulation Factors in Cancer Patients Treated With Radiation Therapy—A Pilot Study. Int J Radiat Oncol Biol Phys 2015; 92:1108-1115. [DOI: 10.1016/j.ijrobp.2015.03.040] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Revised: 03/26/2015] [Accepted: 03/30/2015] [Indexed: 01/03/2023]
|
23
|
Goeminne LJE, Argentini A, Martens L, Clement L. Summarization vs Peptide-Based Models in Label-Free Quantitative Proteomics: Performance, Pitfalls, and Data Analysis Guidelines. J Proteome Res 2015; 14:2457-65. [PMID: 25827922 DOI: 10.1021/pr501223t] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Quantitative label-free mass spectrometry is increasingly used to analyze the proteomes of complex biological samples. However, the choice of appropriate data analysis methods remains a major challenge. We therefore provide a rigorous comparison between peptide-based models and peptide-summarization-based pipelines. We show that peptide-based models outperform summarization-based pipelines in terms of sensitivity, specificity, accuracy, and precision. We also demonstrate that the predefined FDR cutoffs for the detection of differentially regulated proteins can become problematic when differentially expressed (DE) proteins are highly abundant in one or more samples. Care should therefore be taken when data are interpreted from samples with spiked-in internal controls and from samples that contain a few very highly abundant proteins. We do, however, show that specific diagnostic plots can be used for assessing differentially expressed proteins and the overall quality of the obtained fold change estimates. Finally, our study also illustrates that imputation under the "missing by low abundance" assumption is beneficial for the detection of differential expression in proteins with low abundance, but it negatively affects moderately to highly abundant proteins. Hence, imputation strategies that are commonly implemented in standard proteomics software should be used with care.
Collapse
Affiliation(s)
- Ludger J E Goeminne
- ∥Department of Plant Systems Biology, VIB, Ghent University, 9052 Ghent, Belgium
| | | | | | | |
Collapse
|
24
|
Muth T, Kolmeder CA, Salojärvi J, Keskitalo S, Varjosalo M, Verdam FJ, Rensen SS, Reichl U, de Vos WM, Rapp E, Martens L. Navigating through metaproteomics data: a logbook of database searching. Proteomics 2015; 15:3439-53. [PMID: 25778831 DOI: 10.1002/pmic.201400560] [Citation(s) in RCA: 91] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Revised: 02/13/2015] [Accepted: 03/06/2015] [Indexed: 11/12/2022]
Abstract
Metaproteomic research involves various computational challenges during the identification of fragmentation spectra acquired from the proteome of a complex microbiome. These issues are manifold and range from the construction of customized sequence databases, the optimal setting of search parameters to limitations in the identification search algorithms themselves. In order to assess the importance of these individual factors, we studied the effect of strategies to combine different search algorithms, explored the influence of chosen database search settings, and investigated the impact of the size of the protein sequence database used for identification. Furthermore, we applied de novo sequencing as a complementary approach to classic database searching. All evaluations were performed on a human intestinal metaproteome dataset. Pyrococcus furiosus proteome data were used to contrast database searching of metaproteomic data to a classic proteomic experiment. Searching against subsets of metaproteome databases and the use of multiple search engines increased the number of identifications. The integration of P. furiosus sequences in a metaproteomic sequence database showcased the limitation of the target-decoy-controlled false discovery rate approach in combination with large sequence databases. The selection of varying search engine parameters and the application of de novo sequencing represented useful methods to increase the reliability of the results. Based on our findings, we provide recommendations for the data analysis that help researchers to establish or improve analysis workflows in metaproteomics.
Collapse
Affiliation(s)
- Thilo Muth
- Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
| | - Carolin A Kolmeder
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Jarkko Salojärvi
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland
| | - Salla Keskitalo
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Markku Varjosalo
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Froukje J Verdam
- Department of General Surgery, NUTRIM, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Sander S Rensen
- Department of General Surgery, NUTRIM, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Udo Reichl
- Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany.,Otto-von-Guericke University, Bioprocess Engineering, Magdeburg, Germany
| | - Willem M de Vos
- Department of Veterinary Biosciences, University of Helsinki, Helsinki, Finland.,Department of Bacteriology and Immunology, University of Helsinki, Helsinki, Finland.,Laboratory of Microbiology, Wageningen University, Wageningen, The Netherlands
| | - Erdmann Rapp
- Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
| | - Lennart Martens
- Department of Biochemistry, Ghent University, Ghent, Belgium.,Department of Medical Protein Research, VIB, Ghent, Belgium
| |
Collapse
|
25
|
Hattan SJ, Du J, Parker KC. Bifunctional Glass Membrane Designed to Interface SDS-PAGE Separations of Proteins with the Detection of Peptides by Mass Spectrometry. Anal Chem 2015; 87:3685-93. [DOI: 10.1021/ac503980x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Stephen J. Hattan
- SimulTOF Systems, 60 Union Avenue, Sudbury, Massachusetts 01776, United States
| | - Jie Du
- Toxikon Corporation, 15 Wiggins Avenue, Bedford, Massachusetts 01730, United States
| | - Kenneth C. Parker
- SimulTOF Systems, 60 Union Avenue, Sudbury, Massachusetts 01776, United States
| |
Collapse
|
26
|
Oveland E, Muth T, Rapp E, Martens L, Berven FS, Barsnes H. Viewing the proteome: how to visualize proteomics data? Proteomics 2015; 15:1341-55. [PMID: 25504833 DOI: 10.1002/pmic.201400412] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Revised: 10/23/2014] [Accepted: 12/05/2014] [Indexed: 01/18/2023]
Abstract
Proteomics has become one of the main approaches for analyzing and understanding biological systems. Yet similar to other high-throughput analysis methods, the presentation of the large amounts of obtained data in easily interpretable ways remains challenging. In this review, we present an overview of the different ways in which proteomics software supports the visualization and interpretation of proteomics data. The unique challenges and current solutions for visualizing the different aspects of proteomics data, from acquired spectra via protein identification and quantification to pathway analysis, are discussed, and examples of the most useful visualization approaches are highlighted. Finally, we offer our ideas about future directions for proteomics data visualization.
Collapse
Affiliation(s)
- Eystein Oveland
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway; KG Jebsen Centre for Multiple Sclerosis Research, Department of Clinical Medicine, University of Bergen, Bergen, Norway; Department of Clinical Medicine, University of Bergen, Bergen, Norway
| | | | | | | | | | | |
Collapse
|
27
|
Volders PJ, Verheggen K, Menschaert G, Vandepoele K, Martens L, Vandesompele J, Mestdagh P. An update on LNCipedia: a database for annotated human lncRNA sequences. Nucleic Acids Res 2014; 43:D174-80. [PMID: 25378313 PMCID: PMC4383901 DOI: 10.1093/nar/gku1060] [Citation(s) in RCA: 212] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
The human genome is pervasively transcribed, producing thousands of non-coding RNA transcripts. The majority of these transcripts are long non-coding RNAs (lncRNAs) and novel lncRNA genes are being identified at rapid pace. To streamline these efforts, we created LNCipedia, an online repository of lncRNA transcripts and annotation. Here, we present LNCipedia 3.0 (http://www.lncipedia.org), the latest version of the publicly available human lncRNA database. Compared to the previous version of LNCipedia, the database grew over five times in size, gaining over 90,000 new lncRNA transcripts. Assessment of the protein-coding potential of LNCipedia entries is improved with state-of-the art methods that include large-scale reprocessing of publicly available proteomics data. As a result, a high-confidence set of lncRNA transcripts with low coding potential is defined and made available for download. In addition, a tool to assess lncRNA gene conservation between human, mouse and zebrafish has been implemented.
Collapse
Affiliation(s)
| | - Kenneth Verheggen
- Department of Medical Protein Research, VIB, Ghent 9000, Belgium Department of Biochemistry, Ghent University, Ghent 9000 Belgium
| | - Gerben Menschaert
- Department of Mathematical Modelling, Statistics and Bioinformatics, Ghent University, Ghent 9000, Belgium
| | - Klaas Vandepoele
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent 9000, Belgium Department of Plant Systems Biology, VIB, Ghent 9000, Belgium
| | - Lennart Martens
- Department of Medical Protein Research, VIB, Ghent 9000, Belgium Department of Biochemistry, Ghent University, Ghent 9000 Belgium
| | - Jo Vandesompele
- Center for Medical Genetics, Ghent University, Ghent 9000, Belgium
| | - Pieter Mestdagh
- Center for Medical Genetics, Ghent University, Ghent 9000, Belgium
| |
Collapse
|
28
|
Abstract
PURPOSE OF REVIEW By determining metalloproteomes via high-throughput methodology, metalloproteomics provides a research strategy for investigating nutritional and metabolic issues relating to metals. In this review, we examine recent developments in metalloproteomics since its early days approximately 12 years ago, when we utilized metalloproteomics to investigate copper disposition in hepatocytes in relation to Wilson disease. RECENT FINDINGS A metalloproteome is the set of proteins that have metal-binding capacity by being metalloproteins or manifesting metal-binding sites. Like all proteomes, a metalloproteome is determined within the context of a well defined system. It can be ascertained for a single metal or multiple metals in that system. Apart from major technological advances in analytical techniques, recent work has examined metalloproteomes for metals other than copper, notably nickel, zinc and manganese. Given the importance of microbiomes to metabolism, microbial metalloproteomics is a rapidly expanding and promising new field. SUMMARY Metals play key roles in metabolic processes. Sufficient technological progress has taken place in the past decade to make metalloproteomics an exciting and innovative type of research in nutrition and metabolism. It elucidates how metals contribute to metabolic physiology across the phyla, including in microbes. For humans, it may clarify mechanisms as well as identify informative diagnostic or prognostic biomarkers.
Collapse
Affiliation(s)
- Eve A Roberts
- aDivision of Gastroenterology, Hepatology and Nutrition, The Hospital for Sick Children bGenetics and Genome Biology Program cMolecular Structure and Function Program, The Hospital for Sick Children Research Institute dDepartments of Paediatrics eMedicine fPharmacology gBiochemistry, University of Toronto, Toronto, Ontario, Canada
| | | |
Collapse
|
29
|
Effects of L-theanine on posttraumatic stress disorder induced changes in rat brain gene expression. ScientificWorldJournal 2014; 2014:419032. [PMID: 25165739 PMCID: PMC4137547 DOI: 10.1155/2014/419032] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2014] [Revised: 04/11/2014] [Accepted: 05/13/2014] [Indexed: 01/02/2023] Open
Abstract
Posttraumatic stress disorder (PTSD) is characterized by the occurrence of a traumatic event that is beyond the normal range of human experience. The future of PTSD treatment may specifically target the molecular mechanisms of PTSD. In the US, approximately 20% of adults report taking herbal products to treat medical illnesses. L-theanine is the amino acid in green tea primarily responsible for relaxation effects. No studies have evaluated the potential therapeutic properties of herbal medications on gene expression in PTSD. We evaluated gene expression in PTSD-induced changes in the amygdala and hippocampus of Sprague-Dawley rats. The rats were assigned to PTSD-stressed and nonstressed groups that received either saline, midazolam, L-theanine, or L-theanine + midazolam. Amygdala and hippocampus tissue samples were analyzed for changes in gene expression. One-way ANOVA was used to detect significant difference between groups in the amygdala and hippocampus. Of 88 genes examined, 17 had a large effect size greater than 0.138. Of these, 3 genes in the hippocampus and 5 genes in the amygdala were considered significant (P < 0.05) between the groups. RT-PCR analysis revealed significant changes between groups in several genes implicated in a variety of disorders ranging from PTSD, anxiety, mood disorders, and substance dependence.
Collapse
|
30
|
Shen X, Young R, Canty JM, Qu J. Quantitative proteomics in cardiovascular research: global and targeted strategies. Proteomics Clin Appl 2014; 8:488-505. [PMID: 24920501 DOI: 10.1002/prca.201400014] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/02/2014] [Accepted: 06/06/2014] [Indexed: 11/05/2022]
Abstract
Extensive technical advances in the past decade have substantially expanded quantitative proteomics in cardiovascular research. This has great promise for elucidating the mechanisms of cardiovascular diseases and the discovery of cardiac biomarkers used for diagnosis and treatment evaluation. Global and targeted proteomics are the two major avenues of quantitative proteomics. While global approaches enable unbiased discovery of altered proteins via relative quantification at the proteome level, targeted techniques provide higher sensitivity and accuracy, and are capable of multiplexed absolute quantification in numerous clinical/biological samples. While promising, technical challenges need to be overcome to enable full utilization of these techniques in cardiovascular medicine. Here, we discuss recent advances in quantitative proteomics and summarize applications in cardiovascular research with an emphasis on biomarker discovery and elucidating molecular mechanisms of disease. We propose the integration of global and targeted strategies as a high-throughput pipeline for cardiovascular proteomics. Targeted approaches enable rapid, extensive validation of biomarker candidates discovered by global proteomics. These approaches provide a promising alternative to immunoassays and other low-throughput means currently used for limited validation.
Collapse
Affiliation(s)
- Xiaomeng Shen
- Department of Biochemistry, University at Buffalo, Buffalo, NY, USA; New York State Center of Excellence in Bioinformatics and Life Sciences, University at Buffalo, Buffalo, NY, USA
| | | | | | | |
Collapse
|
31
|
Gromov P, Moreira JMA, Gromova I. Proteomic analysis of tissue samples in translational breast cancer research. Expert Rev Proteomics 2014; 11:285-302. [DOI: 10.1586/14789450.2014.899469] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
32
|
Vaudel M, Venne AS, Berven FS, Zahedi RP, Martens L, Barsnes H. Shedding light on black boxes in protein identification. Proteomics 2014; 14:1001-5. [DOI: 10.1002/pmic.201300488] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2013] [Revised: 01/10/2014] [Accepted: 01/22/2014] [Indexed: 12/28/2022]
Affiliation(s)
- Marc Vaudel
- Proteomics Unit; Department of Biomedicine; University of Bergen; Norway
- Leibniz-Institut für Analytische Wissenschaften-ISAS-e.V; Dortmund Germany
| | - A. Saskia Venne
- Leibniz-Institut für Analytische Wissenschaften-ISAS-e.V; Dortmund Germany
| | - Frode S. Berven
- Proteomics Unit; Department of Biomedicine; University of Bergen; Norway
- Department of Clinical Medicine; The KG Jebsen Centre for MS-research; University of Bergen; Bergen Norway
- Department of Neurology; The Norwegian Multiple Sclerosis Competence Centre; Haukeland University Hospital; Bergen Norway
| | - René P. Zahedi
- Leibniz-Institut für Analytische Wissenschaften-ISAS-e.V; Dortmund Germany
| | - Lennart Martens
- Department of Medical Protein Research; VIB; Ghent Belgium
- Department of Biochemistry; Ghent University; Ghent Belgium
| | - Harald Barsnes
- Proteomics Unit; Department of Biomedicine; University of Bergen; Norway
| |
Collapse
|
33
|
Ma X, Sickmann A, Pietsch J, Wildgruber R, Weber G, Infanger M, Bauer J, Grimm D. Proteomic differences between microvascular endothelial cells and the EA.hy926 cell line forming three-dimensional structures. Proteomics 2014; 14:689-98. [PMID: 24376074 DOI: 10.1002/pmic.201300453] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2013] [Revised: 11/28/2013] [Accepted: 12/10/2013] [Indexed: 12/12/2022]
Abstract
Proteomic changes of two types of human endothelial cells (ECs) were determined and compared to morphological alterations occurring during the scaffold-free in vitro formation of 3D structures resembling vascular intimas. The EA.hy926 cell line and human microvascular ECs (HMVECs) were cultured on a random positioning machine or static on ground (normal gravity) for 5 and 7 days, before their morphology was examined and their protein content was analysed by MS after free-flow electrophoretic separation. A total of 1175 types of proteins were found in EA.hy926 cells and 846 in HMVEC forming 3D structures faster than the EA.hy926 cells. Five hundred and eighty-four of these kinds of proteins were present in both types of cells. They included a number of metabolic enzymes, of structure-related and stress proteins. Comparing proteins of EA.hy926 cells growing either adherently on ground or in 3D aggregates on the random positioning machine revealed that ribosomal proteins were enhanced, while tubes are formed and various components of 26S proteasomes remained prevalent in static normal gravity control cells only. The fast developing tube-like 3D structures of HMVEC suggested a transient augmentation of ribosomal proteins during the 3D assembling of ECs.
Collapse
Affiliation(s)
- Xiao Ma
- Institute of Biomedicine, Pharmacology, Aarhus University, Aarhus, Denmark
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
Biological macromolecules such as proteins constitute an important class of products in the food, biotechnology, pharmaceutical, and cosmetics industries. The growing need to develop efficient and rapid protein purification methods is driving research and growth in this area. Advances and progress in the methods and techniques of protein purification have been such that one can reasonably expect that any protein of a given order of stability may be purified to currently acceptable standards of homogeneity. However, protein production cost remains extremely high, with downstream processing constituting a substantial proportion of the overall cost. Understanding of the methods and optimization of experimental conditions have become critical to the manufacturing industry in order to minimize production costs while satisfying all regulatory requirements. New purification protocols exploiting specific, effective, and robust methods and materials are expected to guide the future of the protein purification area.
Collapse
Affiliation(s)
- Nikolaos E Labrou
- Enzyme Technology Laboratory, Department of Biotechnology, Agricultural University of Athens, Iera Odos 75, 11855, Athens, Greece,
| |
Collapse
|
35
|
Verheggen K, Barsnes H, Martens L. Distributed computing and data storage in proteomics: many hands make light work, and a stronger memory. Proteomics 2013; 14:367-77. [PMID: 24285552 DOI: 10.1002/pmic.201300288] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Revised: 09/09/2013] [Accepted: 09/23/2013] [Indexed: 12/25/2022]
Abstract
Modern day proteomics generates ever more complex data, causing the requirements on the storage and processing of such data to outgrow the capacity of most desktop computers. To cope with the increased computational demands, distributed architectures have gained substantial popularity in the recent years. In this review, we provide an overview of the current techniques for distributed computing, along with examples of how the techniques are currently being employed in the field of proteomics. We thus underline the benefits of distributed computing in proteomics, while also pointing out the potential issues and pitfalls involved.
Collapse
Affiliation(s)
- Kenneth Verheggen
- Department of Medical Protein Research, VIB, Ghent, Belgium; Department of Biochemistry, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | | | | |
Collapse
|
36
|
Vandermarliere E, Mueller M, Martens L. Getting intimate with trypsin, the leading protease in proteomics. MASS SPECTROMETRY REVIEWS 2013; 32:453-65. [PMID: 23775586 DOI: 10.1002/mas.21376] [Citation(s) in RCA: 158] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Revised: 02/15/2013] [Accepted: 02/15/2013] [Indexed: 05/21/2023]
Abstract
Nowadays, mass spectrometry-based proteomics is carried out primarily in a bottom-up fashion, with peptides obtained after proteolytic digest of a whole proteome lysate as the primary analytes instead of the proteins themselves. This experimental setup crucially relies on a protease to digest an abundant and complex protein mixture into a far more complex peptide mixture. Full knowledge of the working mechanism and specificity of the used proteases is therefore crucial, both for the digestion step itself as well as for the downstream identification and quantification of the (fragmentation) mass spectra acquired for the peptides in the mixture. Targeted protein analysis through selected reaction monitoring, a relative newcomer in the specific field of mass spectrometry-based proteomics, even requires a priori understanding of protease behavior for the proteins of interest. Because of the rapidly increasing popularity of proteomics as an analytical tool in the life sciences, there is now a renewed demand for detailed knowledge on trypsin, the workhorse protease in proteomics. This review addresses this need and provides an overview on the structure and working mechanism of trypsin, followed by a critical analysis of its cleavage behavior, typically simply accepted to occur exclusively yet consistently after Arg and Lys, unless they are followed by a Pro. In this context, shortcomings in our ability to understand and predict the behavior of trypsin will be highlighted, along with the downstream implications. Furthermore, an analysis is carried out on the inherent shortcomings of trypsin with regard to whole proteome analysis, and alternative approaches will be presented that can alleviate these issues. Finally, some reflections on the future of trypsin as the workhorse protease in mass spectrometry-based proteomics will be provided.
Collapse
Affiliation(s)
- Elien Vandermarliere
- Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium; Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | | | | |
Collapse
|
37
|
Wildgruber R, Weber G, Wise P, Grimm D, Bauer J. Free-flow electrophoresis in proteome sample preparation. Proteomics 2013; 14:629-36. [PMID: 24123730 DOI: 10.1002/pmic.201300253] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2013] [Revised: 08/07/2013] [Accepted: 08/23/2013] [Indexed: 02/01/2023]
Abstract
An aim of proteome research is to identify the entire complement of proteins expressed in defined cell types of humans, animals, plants, and microorganisms. The approach requires searching for low abundant or even rarely expressed proteins in many cell types, as well as the determination of the protein expression levels in subcellular compartments and organelles. In recent years, rather powerful MS technologies have been developed. At this stage of MS device development, it is of highest interest to purify intact cell types or isolate subcellular compartments, where the proteins of interest are originating from, which determine the final composition of a peptide mixture. Free-flow electrophoresis proved to be useful to prepare meaningful peptide mixtures because of its improved capabilities in particle electrophoresis and the enhanced resolution in protein separation. Sample preparation by free-flow electrophoresis mediated particle separation was preferentially performed for purification of either organelles and their subspecies or major protein complexes. Especially, the introduction of isotachophoresis and interval zone electrophoresis improved the purity of the gained analytes of interest. In addition, free-flow IEF proved to be helpful, when proteins of low solubility, obtained, e.g. from cell membranes, were investigated.
Collapse
|
38
|
Muth T, Benndorf D, Reichl U, Rapp E, Martens L. Searching for a needle in a stack of needles: challenges in metaproteomics data analysis. MOLECULAR BIOSYSTEMS 2013; 9:578-85. [PMID: 23238088 DOI: 10.1039/c2mb25415h] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
In the past years the integral study of microbial communities of varying complexity has gained increasing research interest. Mass spectrometry-driven metaproteomics enables the analysis of such communities on the functional level, but this fledgling field still faces various technical and semantic challenges regarding experimental data analysis and interpretation. In the present review, we outline the hurdles involved and attempt to cover the most valuable methods and software implementations available to researchers in the field today. Beyond merely focusing on protein identification, we provide an overview on different data pre- and post-processing steps, such as metabolic pathway analysis, that can be useful in a typical metaproteomics workflow. Finally, we briefly discuss directions for future work.
Collapse
Affiliation(s)
- Thilo Muth
- Max Planck Institute for Dynamics of Complex Technical Systems, Bioprocess Engineering, Magdeburg, Germany
| | | | | | | | | |
Collapse
|
39
|
Thomas SN, Liao Z, Clark D, Chen Y, Samadani R, Mao L, Ann DK, Baulch JE, Shapiro P, Yang AJ. Exosomal Proteome Profiling: A Potential Multi-Marker Cellular Phenotyping Tool to Characterize Hypoxia-Induced Radiation Resistance in Breast Cancer. Proteomes 2013; 1:87-108. [PMID: 24860738 PMCID: PMC4029595 DOI: 10.3390/proteomes1020087] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Radiation and drug resistance are significant challenges in the treatment of locally advanced, recurrent and metastatic breast cancer that contribute to mortality. Clinically, radiotherapy requires oxygen to generate cytotoxic free radicals that cause DNA damage and allow that damage to become fixed in the genome rather than repaired. However, approximately 40% of all breast cancers have hypoxic tumor microenvironments that render cancer cells significantly more resistant to irradiation. Hypoxic stimuli trigger changes in the cell death/survival pathway that lead to increased cellular radiation resistance. As a result, the development of noninvasive strategies to assess tumor hypoxia in breast cancer has recently received considerable attention. Exosomes are secreted nanovesicles that have roles in paracrine signaling during breast tumor progression, including tumor-stromal interactions, activation of proliferative pathways and immunosuppression. The recent development of protocols to isolate and purify exosomes, as well as advances in mass spectrometry-based proteomics have facilitated the comprehensive analysis of exosome content and function. Using these tools, studies have demonstrated that the proteome profiles of tumor-derived exosomes are indicative of the oxygenation status of patient tumors. They have also demonstrated that exosome signaling pathways are potentially targetable drivers of hypoxia-dependent intercellular signaling during tumorigenesis. This article provides an overview of how proteomic tools can be effectively used to characterize exosomes and elucidate fundamental signaling pathways and survival mechanisms underlying hypoxia-mediated radiation resistance in breast cancer.
Collapse
Affiliation(s)
- Stefani N Thomas
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA;
| | | | - David Clark
- Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD 21201, USA; (D.C.); (Y.C.); (P.S.) ; Division of Oncology, University of Maryland School of Dentistry, Baltimore, MD 21201, USA
| | - Yangyi Chen
- Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD 21201, USA; (D.C.); (Y.C.); (P.S.)
| | - Ramin Samadani
- Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, Baltimore, MD 21201, USA;
| | - Li Mao
- Oncology and Diagnostic Sciences, University of Maryland School of Dentistry, Baltimore, MD 21201, USA;
| | - David K Ann
- Department of Molecular Pharmacology, Beckman Research Institute, City of Hope, Duarte, CA 91010, USA; ; Irell and Manella Graduate School of Biological Sciences, Beckman Research Institute, City of Hope, Duarte, CA 91010, USA
| | - Janet E Baulch
- Department of Radiation Oncology, University of California, Irvine, CA 92697, USA;
| | - Paul Shapiro
- Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD 21201, USA; (D.C.); (Y.C.); (P.S.) ; Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, Baltimore, MD 21201, USA;
| | - Austin J Yang
- Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD 21201, USA; (D.C.); (Y.C.); (P.S.) ; Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
40
|
Fu Z, Wang M, Everett A, Lakatta E, Van Eyk J. Can proteomics yield insight into aging aorta? Proteomics Clin Appl 2013; 7:477-89. [PMID: 23788441 DOI: 10.1002/prca.201200138] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2012] [Revised: 06/13/2013] [Accepted: 06/14/2013] [Indexed: 12/16/2022]
Abstract
The aging aorta exhibits structural and physiological changes that are reflected in the proteome of its component cells types. The advance in proteomic technologies has made it possible to analyze the quantity of proteins associated with the natural history of aortic aging. These alterations reflect the molecular and cellular mechanisms of aging and could provide an opportunity to predict vascular health. This paper focuses on whether discoveries stemming from the application of proteomic approaches of the intact aging aorta or vascular smooth muscle cells can provide useful insights. Although there have been limited studies to date, a number of interesting proteins have been identified that are closely associated with aging in the rat aorta. Such proteins, including milk fat globule-EGF factor 8, matrix metalloproteinase type-2, and vitronectin, could be used as indicators of vascular health, or even explored as therapeutic targets for aging-related vascular diseases.
Collapse
Affiliation(s)
- Zongming Fu
- Department of Pediatrics, The Johns Hopkins University, Baltimore, MD 21224, USA
| | | | | | | | | |
Collapse
|
41
|
SCX charge state selective separation of tryptic peptides combined with 2D-RP-HPLC allows for detailed proteome mapping. J Proteomics 2013; 91:164-71. [PMID: 23851314 DOI: 10.1016/j.jprot.2013.06.033] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Revised: 06/25/2013] [Accepted: 06/29/2013] [Indexed: 11/20/2022]
Abstract
UNLABELLED Multidimensional peptide fractionation is widely used in proteomics to reduce the complexity of peptide mixtures prior to mass spectrometric analysis. Here, we describe the sequential use of strong cation exchange and reversed phase liquid chromatography in both basic and acidic pH buffers for separating tryptic peptides from complex mixtures of proteins. Strong cation exchange exclusively separates peptide by their charge state into neutral, singly and multi-charged species. To further reduce complexity, each peptide group was separated by reversed phase liquid chromatography at basic pH and the resultant fractions were analyzed by LC-MS/MS. This workflow was applied to a soluble protein lysate from mouse embryonic fibroblast cells, and more than 5000 proteins from 29,843 peptides were identified. The high selectivity displayed during the SCX step (93% to 100%) and the overlaps between proteins identified from the SCX-separated peptide groups, are interesting assets of the procedure. BIOLOGICAL SIGNIFICANCE The present work shows how complex mixture of peptides can be selectively separated by SCX based essentially on the net charge of peptides. The proposed workflow results in three well-defined subset of peptides of specific amino acid composition, which are representative of the constituent proteins. The very high selectivity obtained (93% to 99%) on the peptide side, underscores for the first time the possibility of SCX chromatography to aid in validating identified peptides.
Collapse
|
42
|
Vaudel M, Sickmann A, Martens L. Introduction to opportunities and pitfalls in functional mass spectrometry based proteomics. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1844:12-20. [PMID: 23845992 DOI: 10.1016/j.bbapap.2013.06.019] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2012] [Revised: 06/05/2013] [Accepted: 06/25/2013] [Indexed: 10/26/2022]
Abstract
With the advent of mass spectrometry based proteomics, the identification of thousands of proteins has become commonplace in biology nowadays. Increasingly, efforts have also been invested toward the detection and localization of posttranslational modifications. It is furthermore common practice to quantify the identified entities, a task supported by a panel of different methods. Finally, the results can also be enriched with functional knowledge gained on the proteins, detecting for instance differentially expressed gene ontology terms or biological pathways. In this study, we review the resources, methods and tools available for the researcher to achieve such a quantitative functional analysis. These include statistics for the post-processing of identification and quantification results, online resources and public repositories. With a focus on free but user-friendly software, preferably also open-source, we provide a list of tools designed to help the researcher manage the vast amount of data generated. We also indicate where such applications currently remain lacking. Moreover, we stress the eventual pitfalls of every step of such studies. This article is part of a Special Issue entitled: Computational Proteomics in the Post-Identification Era. Guest Editors: Martin Eisenacher and Christian Stephan.
Collapse
Affiliation(s)
- Marc Vaudel
- Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany; Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen, Bergen, Norway.
| | | | | |
Collapse
|