Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Granholm V, Navarro JF, Noble WS, Käll L. Determining the calibration of confidence estimation procedures for unique peptides in shotgun proteomics. J Proteomics 2012;80:123-31. [PMID: 23268117 DOI: 10.1016/j.jprot.2012.12.007] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2012] [Revised: 11/30/2012] [Accepted: 12/11/2012] [Indexed: 01/10/2023]

For:	Granholm V, Navarro JF, Noble WS, Käll L. Determining the calibration of confidence estimation procedures for unique peptides in shotgun proteomics. J Proteomics 2012;80:123-31. [PMID: 23268117 DOI: 10.1016/j.jprot.2012.12.007] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2012] [Revised: 11/30/2012] [Accepted: 12/11/2012] [Indexed: 01/10/2023]

Number

Cited by Other Article(s)

Frankenfield AM, Yang KL, Mazli WNAB, Shih J, Yu F, Lo E, Nesvizhskii AI, Hao L. Benchmarking SILAC Proteomics Workflows and Data Analysis Platforms. Mol Cell Proteomics 2025;24:100980. [PMID: 40315959 DOI: 10.1016/j.mcpro.2025.100980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2024] [Revised: 04/07/2025] [Accepted: 04/28/2025] [Indexed: 05/04/2025] Open

Abstract

Stable isotope labeling by amino acids in cell culture (SILAC) is a powerful metabolic labeling technique with broad applications and various study designs. SILAC proteomics relies on the accurate identification and quantification of all isotopic versions of proteins and peptides during both data acquisition and analysis. However, a comprehensive comparison and evaluation of SILAC data analysis platforms is currently lacking. To address this critical gap and offer practical guidelines for SILAC proteomics data analysis, we designed a comprehensive benchmarking pipeline to evaluate various in vitro SILAC workflows and commonly used data analysis software. Ten different SILAC data analysis workflows using five software packages (MaxQuant, Proteome Discoverer, FragPipe, DIA-NN, and Spectronaut) were evaluated for static and dynamic SILAC labeling with both DDA and DIA methods. For benchmarking, we used both in-house generated and repository SILAC proteomics datasets from HeLa and neuron culture samples. We assessed 12 performance metrics for SILAC proteomics including identification, quantification, accuracy, precision, reproducibility, filtering criteria, missing values, false discovery rate, protein half-life measurement, data completeness, unique software features, and speed of data analysis. Each method/software has its strengths and weaknesses when evaluated for these performance metrics. Most software reaches a dynamic range limit of 100-fold for accurate quantification of light/heavy ratios. We do not recommend using Proteome Discoverer for SILAC DDA analysis despite its wide use in label-free proteomics. To achieve greater confidence in SILAC quantification, researchers could use more than one software packages to analyze the same dataset for cross-validation. In summary, this study offers the first systematic evaluation of various SILAC data analysis platforms, providing practical guidelines to support decision-making in SILAC proteomics study design and data analysis.

Collapse

Zelter A, Riffle M, Shteynberg DD, Zhong G, Riddle EB, Hoopmann MR, Jaschob D, Moritz RL, Davis TN, MacCoss MJ, Isoherranen N. Detection and Quantification of Drug-Protein Adducts in Human Liver. J Proteome Res 2024;23:5143-5152. [PMID: 39442081 PMCID: PMC11537226 DOI: 10.1021/acs.jproteome.4c00663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 09/19/2024] [Accepted: 10/10/2024] [Indexed: 10/25/2024]

Madej D, Lam H. On the use of tandem mass spectra acquired from samples of evolutionarily distant organisms to validate methods for false discovery rate estimation. Proteomics 2024;24:e2300398. [PMID: 38491400 DOI: 10.1002/pmic.202300398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 03/01/2024] [Accepted: 03/06/2024] [Indexed: 03/18/2024]

Fröhlich K, Fahrner M, Brombacher E, Seredynska A, Maldacker M, Kreutz C, Schmidt A, Schilling O. Data-Independent Acquisition: A Milestone and Prospect in Clinical Mass Spectrometry-Based Proteomics. Mol Cell Proteomics 2024;23:100800. [PMID: 38880244 PMCID: PMC11380018 DOI: 10.1016/j.mcpro.2024.100800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 06/08/2024] [Accepted: 06/13/2024] [Indexed: 06/18/2024] Open

Abstract

Data-independent acquisition (DIA) has revolutionized the field of mass spectrometry (MS)-based proteomics over the past few years. DIA stands out for its ability to systematically sample all peptides in a given m/z range, allowing an unbiased acquisition of proteomics data. This greatly mitigates the issue of missing values and significantly enhances quantitative accuracy, precision, and reproducibility compared to many traditional methods. This review focuses on the critical role of DIA analysis software tools, primarily focusing on their capabilities and the challenges they address in proteomic research. Advances in MS technology, such as trapped ion mobility spectrometry, or high field asymmetric waveform ion mobility spectrometry require sophisticated analysis software capable of handling the increased data complexity and exploiting the full potential of DIA. We identify and critically evaluate leading software tools in the DIA landscape, discussing their unique features, and the reliability of their quantitative and qualitative outputs. We present the biological and clinical relevance of DIA-MS and discuss crucial publications that paved the way for in-depth proteomic characterization in patient-derived specimens. Furthermore, we provide a perspective on emerging trends in clinical applications and present upcoming challenges including standardization and certification of MS-based acquisition strategies in molecular diagnostics. While we emphasize the need for continuous development of software tools to keep pace with evolving technologies, we advise researchers against uncritically accepting the results from DIA software tools. Each tool may have its own biases, and some may not be as sensitive or reliable as others. Our overarching recommendation for both researchers and clinicians is to employ multiple DIA analysis tools, utilizing orthogonal analysis approaches to enhance the robustness and reliability of their findings.

Collapse

Affiliation(s)

Klemens Fröhlich Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
Matthias Fahrner Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany
Eva Brombacher Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg, Germany; Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
Adrianna Seredynska Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
Maximilian Maldacker Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
Clemens Kreutz Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg, Germany
Alexander Schmidt Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
Oliver Schilling Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany.

Collapse

Chen YE, Ge X, Woyshner K, McDermott M, Manousopoulou A, Ficarro SB, Marto JA, Li K, Wang LD, Li JJ. APIR: Aggregating Universal Proteomics Database Search Algorithms for Peptide Identification with FDR Control. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024;22:qzae042. [PMID: 39198030 DOI: 10.1093/gpbjnl/qzae042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 02/26/2024] [Accepted: 03/11/2024] [Indexed: 09/01/2024]

Affiliation(s)

Yiling Elaine Chen Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA
Xinzhou Ge Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA
Kyla Woyshner Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA
MeiLu McDermott Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
Antigoni Manousopoulou Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA
Scott B Ficarro Department of Cancer Biology and Blais Proteomics Center, Dana-Farber Cancer Institute, Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02215, USA
Jarrod A Marto Department of Cancer Biology and Blais Proteomics Center, Dana-Farber Cancer Institute, Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02215, USA
Kexin Li Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA
Leo David Wang Department of Immuno-Oncology, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA 91010, USA Department of Pediatrics, City of Hope National Medical Center, Duarte, CA 91010, USA
Jingyi Jessica Li Department of Statistics and Data Science, University of California, Los Angeles, CA 90095, USA Bioinformatics Interdepartmental Program, University of California, Los Angeles, CA 90095, USA Department of Human Genetics, University of California, Los Angeles, CA 90095, USA Department of Computational Medicine, University of California, Los Angeles, CA 90095, USA Department of Biostatistics, University of California, Los Angeles, CA 90095, USA

Collapse

Freestone J, Noble WS, Keich U. Reinvestigating the Correctness of Decoy-Based False Discovery Rate Control in Proteomics Tandem Mass Spectrometry. J Proteome Res 2024;23:1907-1914. [PMID: 38687997 DOI: 10.1021/acs.jproteome.3c00902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]

Lin A, See D, Fondrie WE, Keich U, Noble WS. Target-decoy false discovery rate estimation using Crema. Proteomics 2024;24:e2300084. [PMID: 38380501 DOI: 10.1002/pmic.202300084] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 01/06/2024] [Accepted: 01/16/2024] [Indexed: 02/22/2024]

Strauss MT, Bludau I, Zeng WF, Voytik E, Ammar C, Schessner JP, Ilango R, Gill M, Meier F, Willems S, Mann M. AlphaPept: a modern and open framework for MS-based proteomics. Nat Commun 2024;15:2168. [PMID: 38461149 PMCID: PMC10924963 DOI: 10.1038/s41467-024-46485-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 02/20/2024] [Indexed: 03/11/2024] Open

Yu F, Teo GC, Kong AT, Fröhlich K, Li GX, Demichev V, Nesvizhskii AI. Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform. Nat Commun 2023;14:4154. [PMID: 37438352 PMCID: PMC10338508 DOI: 10.1038/s41467-023-39869-5] [Citation(s) in RCA: 92] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 06/28/2023] [Indexed: 07/14/2023] Open

Phlairaharn T, Ye Z, Krismer E, Pedersen AK, Pietzner M, Olsen JV, Schoof EM, Searle BC. Optimizing Linear Ion-Trap Data-Independent Acquisition toward Single-Cell Proteomics. Anal Chem 2023;95:9881-9891. [PMID: 37338819 DOI: 10.1021/acs.analchem.3c00842] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/21/2023]

Zhang Q. Mzion enables deep and precise identification of peptides in data-dependent acquisition proteomics. Sci Rep 2023;13:7056. [PMID: 37120666 PMCID: PMC10148867 DOI: 10.1038/s41598-023-34323-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 04/27/2023] [Indexed: 05/01/2023] Open

Phlairaharn T, Ye Z, Krismer E, Pedersen AK, Pietzner M, Olsen JV, Schoof EM, Searle BC. Optimizing linear ion trap data independent acquisition towards single cell proteomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.21.529444. [PMID: 36865114 PMCID: PMC9980145 DOI: 10.1101/2023.02.21.529444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]

The M, Käll L. Integrating Identification and Quantification Uncertainty for Differential Protein Abundance Analysis with Triqler. Methods Mol Biol 2023;2426:91-117. [PMID: 36308686 DOI: 10.1007/978-1-0716-1967-4_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Hasam S, Emery K, Noble WS, Keich U. A Pipeline for Peptide Detection Using Multiple Decoys. Methods Mol Biol 2023;2426:25-34. [PMID: 36308683 DOI: 10.1007/978-1-0716-1967-4_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Reanalysis of ProteomicsDB Using an Accurate, Sensitive, and Scalable False Discovery Rate Estimation Approach for Protein Groups. Mol Cell Proteomics 2022;21:100437. [PMID: 36328188 PMCID: PMC9718969 DOI: 10.1016/j.mcpro.2022.100437] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 10/16/2022] [Accepted: 10/28/2022] [Indexed: 11/07/2022] Open

Abstract

Estimating false discovery rates (FDRs) of protein identification continues to be an important topic in mass spectrometry-based proteomics, particularly when analyzing very large datasets. One performant method for this purpose is the Picked Protein FDR approach which is based on a target-decoy competition strategy on the protein level that ensures that FDRs scale to large datasets. Here, we present an extension to this method that can also deal with protein groups, that is, proteins that share common peptides such as protein isoforms of the same gene. To obtain well-calibrated FDR estimates that preserve protein identification sensitivity, we introduce two novel ideas. First, the picked group target-decoy and second, the rescued subset grouping strategies. Using entrapment searches and simulated data for validation, we demonstrate that the new Picked Protein Group FDR method produces accurate protein group-level FDR estimates regardless of the size of the data set. The validation analysis also uncovered that applying the commonly used Occam's razor principle leads to anticonservative FDR estimates for large datasets. This is not the case for the Picked Protein Group FDR method. Reanalysis of deep proteomes of 29 human tissues showed that the new method identified up to 4% more protein groups than MaxQuant. Applying the method to the reanalysis of the entire human section of ProteomicsDB led to the identification of 18,000 protein groups at 1% protein group-level FDR. The analysis also showed that about 1250 genes were represented by ≥2 identified protein groups. To make the method accessible to the proteomics community, we provide a software tool including a graphical user interface that enables merging results from multiple MaxQuant searches into a single list of identified and quantified protein groups.

Collapse

Lee S, Park H, Kim H. False discovery rate estimation using candidate peptides for each spectrum. BMC Bioinformatics 2022;23:454. [PMID: 36319948 PMCID: PMC9623924 DOI: 10.1186/s12859-022-05002-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 10/25/2022] [Indexed: 11/06/2022] Open

Abstract

BACKGROUND

False discovery rate (FDR) estimation is very important in proteomics. The target-decoy strategy (TDS), which is often used for FDR estimation, estimates the FDR under the assumption that when spectra are identified incorrectly, the probabilities of the spectra matching the target or decoy peptides are identical. However, no spectra matching target or decoy peptide probabilities are identical. We propose cTDS (target-decoy strategy with candidate peptides) for accurate estimation of the FDR using the probability that the spectrum is identified incorrectly as a target or decoy peptide.

RESULTS

Most spectrum cases result in a probability of having the spectrum identified incorrectly as a target or decoy peptide of close to 0.5, but only about 1.14-4.85% of the total spectra have an exact probability of 0.5. We used an entrapment sequence method to demonstrate the accuracy of cTDS. For fixed FDR thresholds (1-10%), the false match rate (FMR) in cTDS is closer than the FMR in TDS. We compared the number of peptide-spectrum matches (PSMs) obtained with TDS and cTDS at a 1% FDR threshold with the HEK293 dataset. In the first and third replications, the number of PSMs obtained with cTDS for the reverse, pseudo-reverse, shuffle, and de Bruijn databases exceeded those obtained with TDS (about 0.001-0.132%), with the pseudo-shuffle database containing less compared to TDS (about 0.05-0.126%). In the second replication, the number of PSMs obtained with cTDS for all databases exceeds that obtained with TDS (about 0.013-0.274%).

CONCLUSIONS

When spectra are actually identified incorrectly, most probabilities of the spectra matching a target or decoy peptide are not identical. Therefore, we propose cTDS, which estimates the FDR more accurately using the probability of the spectrum being identified incorrectly as a target or decoy peptide.

Collapse

Lin A, Short T, Noble WS, Keich U. Improving Peptide-Level Mass Spectrometry Analysis via Double Competition. J Proteome Res 2022;21:2412-2420. [PMID: 36166314 PMCID: PMC10108709 DOI: 10.1021/acs.jproteome.2c00282] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Freestone J, Short T, Noble WS, Keich U. Group-walk: a rigorous approach to group-wise false discovery rate analysis by target-decoy competition. Bioinformatics 2022;38:ii82-ii88. [PMID: 36124786 DOI: 10.1093/bioinformatics/btac471] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Abstract

MOTIVATION

Target-decoy competition (TDC) is a commonly used method for false discovery rate (FDR) control in the analysis of tandem mass spectrometry data. This type of competition-based FDR control has recently gained significant popularity in other fields after Barber and Candès laid its theoretical foundation in a more general setting that included the feature selection problem. In both cases, the competition is based on a head-to-head comparison between an (observed) target score and a corresponding decoy (knockoff) score. However, the effectiveness of TDC depends on whether the data are homogeneous, which is often not the case: in many settings, the data consist of groups with different score profiles or different proportions of true nulls. In such cases, applying TDC while ignoring the group structure often yields imbalanced lists of discoveries, where some groups might include relatively many false discoveries and other groups include relatively very few. On the other hand, as we show, the alternative approach of applying TDC separately to each group does not rigorously control the FDR.

RESULTS

We developed Group-walk, a procedure that controls the FDR in the target-decoy/knockoff setting while taking into account a given group structure. Group-walk is derived from the recently developed AdaPT-a general framework for controlling the FDR with side-information. We show using simulated and real datasets that when the data naturally divide into groups with different characteristics Group-walk can deliver consistent power gains that in some cases are substantial. These groupings include the precursor charge state (4% more discovered peptides at 1% FDR threshold), the peptide length (3.6% increase) and the mass difference due to modifications (26% increase).

AVAILABILITY AND IMPLEMENTATION

Group-walk is available at https://cran.r-project.org/web/packages/groupwalk/index.html.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Physiological and molecular responses of lobe coral indicate nearshore adaptations to anthropogenic stressors. Sci Rep 2021;11:3423. [PMID: 33564085 PMCID: PMC7873073 DOI: 10.1038/s41598-021-82569-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 01/18/2021] [Indexed: 01/08/2023] Open

Sherafat E, Force J, Măndoiu II. Semi-supervised learning for somatic variant calling and peptide identification in personalized cancer immunotherapy. BMC Bioinformatics 2020;21:498. [PMID: 33375939 PMCID: PMC7772914 DOI: 10.1186/s12859-020-03813-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 10/13/2020] [Indexed: 02/03/2023] Open

Couté Y, Bruley C, Burger T. Beyond Target-Decoy Competition: Stable Validation of Peptide and Protein Identifications in Mass Spectrometry-Based Discovery Proteomics. Anal Chem 2020;92:14898-14906. [PMID: 32970414 DOI: 10.1021/acs.analchem.0c00328] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Prieto G, Vázquez J. Protein Probability Model for High-Throughput Protein Identification by Mass Spectrometry-Based Proteomics. J Proteome Res 2020;19:1285-1297. [PMID: 32037837 DOI: 10.1021/acs.jproteome.9b00819] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Prieto G, Vázquez J. Calculation of False Discovery Rate for Peptide and Protein Identification. Methods Mol Biol 2020;2051:145-159. [PMID: 31552628 DOI: 10.1007/978-1-4939-9744-2_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Mikan MP, Harvey HR, Timmins-Schiffman E, Riffle M, May DH, Salter I, Noble WS, Nunn BL. Metaproteomics reveal that rapid perturbations in organic matter prioritize functional restructuring over taxonomy in western Arctic Ocean microbiomes. THE ISME JOURNAL 2020;14:39-52. [PMID: 31492961 PMCID: PMC6908719 DOI: 10.1038/s41396-019-0503-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Revised: 07/31/2019] [Accepted: 08/06/2019] [Indexed: 02/05/2023]

Chen ZL, Meng JM, Cao Y, Yin JL, Fang RQ, Fan SB, Liu C, Zeng WF, Ding YH, Tan D, Wu L, Zhou WJ, Chi H, Sun RX, Dong MQ, He SM. A high-speed search engine pLink 2 with systematic evaluation for proteome-scale identification of cross-linked peptides. Nat Commun 2019;10:3404. [PMID: 31363125 PMCID: PMC6667459 DOI: 10.1038/s41467-019-11337-z] [Citation(s) in RCA: 294] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2018] [Accepted: 06/20/2019] [Indexed: 01/05/2023] Open

Affiliation(s)

Zhen-Lin Chen Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Jia-Ming Meng Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Yong Cao National Institute of Biological Sciences, Beijing, 102206, China
Ji-Li Yin Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Run-Qian Fang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Sheng-Bo Fan Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Chao Liu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Wen-Feng Zeng Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Yue-He Ding National Institute of Biological Sciences, Beijing, 102206, China
Dan Tan National Institute of Biological Sciences, Beijing, 102206, China
Long Wu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Wen-Jing Zhou Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Hao Chi Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
Rui-Xiang Sun National Institute of Biological Sciences, Beijing, 102206, China
Meng-Qiu Dong National Institute of Biological Sciences, Beijing, 102206, China.
Si-Min He Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China.

Collapse

The M, Käll L. Integrated Identification and Quantification Error Probabilities for Shotgun Proteomics. Mol Cell Proteomics 2019;18:561-570. [PMID: 30482846 PMCID: PMC6398204 DOI: 10.1074/mcp.ra118.001018] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Revised: 11/05/2018] [Indexed: 02/02/2023] Open

Hu A, Lu YY, Bilmes J, Noble WS. Joint Precursor Elution Profile Inference via Regression for Peptide Detection in Data-Independent Acquisition Mass Spectra. J Proteome Res 2019;18:86-94. [PMID: 30362768 PMCID: PMC6465123 DOI: 10.1021/acs.jproteome.8b00365] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Chi H, Liu C, Yang H, Zeng WF, Wu L, Zhou WJ, Wang RM, Niu XN, Ding YH, Zhang Y, Wang ZW, Chen ZL, Sun RX, Liu T, Tan GM, Dong MQ, Xu P, Zhang PH, He SM. Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine. Nat Biotechnol 2018;36:nbt.4236. [PMID: 30295672 DOI: 10.1038/nbt.4236] [Citation(s) in RCA: 253] [Impact Index Per Article: 36.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2017] [Accepted: 08/03/2018] [Indexed: 12/27/2022]

Affiliation(s)

Hao Chi Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Chao Liu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Hao Yang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Wen-Feng Zeng Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Long Wu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Wen-Jing Zhou Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Rui-Min Wang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Xiu-Nan Niu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Yue-He Ding National Institute of Biological Sciences, Beijing, Beijing, China
Yao Zhang State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, College of Ecology and Evolution, Sun Yat-Sen University, Guangzhou, China
Zhao-Wei Wang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Zhen-Lin Chen Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Rui-Xiang Sun Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China
Tao Liu Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China
Guang-Ming Tan Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China
Meng-Qiu Dong National Institute of Biological Sciences, Beijing, Beijing, China
Ping Xu State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
Pei-Heng Zhang Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China
Si-Min He Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, China University of Chinese Academy of Sciences, Beijing, China

Collapse

The M, Edfors F, Perez-Riverol Y, Payne SH, Hoopmann MR, Palmblad M, Forsström B, Käll L. A Protein Standard That Emulates Homology for the Characterization of Protein Inference Algorithms. J Proteome Res 2018;17:1879-1886. [PMID: 29631402 DOI: 10.1021/acs.jproteome.7b00899] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Ting YS, Egertson JD, Bollinger JG, Searle BC, Payne SH, Noble WS, MacCoss MJ. PECAN: library-free peptide detection for data-independent acquisition tandem mass spectrometry data. Nat Methods 2017;14:903-908. [PMID: 28783153 PMCID: PMC5578911 DOI: 10.1038/nmeth.4390] [Citation(s) in RCA: 137] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Accepted: 06/20/2017] [Indexed: 12/18/2022]

Levitsky LI, Ivanov MV, Lobas AA, Gorshkov MV. Unbiased False Discovery Rate Estimation for Shotgun Proteomics Based on the Target-Decoy Approach. J Proteome Res 2016;16:393-397. [DOI: 10.1021/acs.jproteome.6b00144] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

The M, MacCoss MJ, Noble WS, Käll L. Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2016;27:1719-1727. [PMID: 27572102 PMCID: PMC5059416 DOI: 10.1007/s13361-016-1460-7] [Citation(s) in RCA: 286] [Impact Index Per Article: 31.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 06/15/2016] [Accepted: 07/20/2016] [Indexed: 05/21/2023]

Nardiello D, Natale A, Palermo C, Quinto M, Centonze D. Combined use of peptide ion and normalized delta scores to evaluate milk authenticity by ion-trap based proteomics coupled with error tolerant searching. Talanta 2016;164:684-692. [PMID: 28107990 DOI: 10.1016/j.talanta.2016.10.102] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Revised: 10/25/2016] [Accepted: 10/30/2016] [Indexed: 12/17/2022]

Abstract

A fundamental issue in proteomics is the peptide identification by database searching and the assessment of the goodness of fit between experimental and theoretical data. Despite the different number of ways to measure the quality of search results, the definition of a scoring criterion is still highly desirable in ion-trap based proteomics. Indeed, in order to fully take advantage of a low resolution MS/MS dataset, it is essential to strike a balance between greater information capture and reduced number of incorrect peptide assignments. In addition, the development of user-specified rules is a crucial aspect when very similar proteins of the same family are analyzed in order to infer the origin species. In this study, a post-processing validation scheme is provided for the evaluation of proteomic data in shot-gun ion-trap proteomics, when a flexible database searching based on the error tolerant mode is adopted in combination with a low-specificity enzyme to maximize sequence coverage. To validate peptide assignments, we used standard β-casein digested with trypsin/chymotrypsin or trypsin alone and the popular search engine MASCOT to identify the relevant (known) peptide sequences. A linear combination between peptide ion score and normalized delta score (i.e. the difference between the best and the second best ion score, divided by the best score) is proposed to increase the accuracy in sequence assignments from low-resolution tandem mass spectra. Finally, the optimized post-processing database validation was successfully applied to the direct analysis of milk tryptic/chymotryptic digests of different origin, without resorting to two-dimensional electrophoresis that is usually performed for protein separation in ion-trap proteomics. The identification of species-specific amino acidic sequences among the validated peptide spectrum matches has allowed to fully discriminate between the animal species, so evaluating accurately the milk authenticity.

Collapse

The M, Tasnim A, Käll L. How to talk about protein-level false discovery rates in shotgun proteomics. Proteomics 2016;16:2461-9. [PMID: 27503675 PMCID: PMC5096025 DOI: 10.1002/pmic.201500431] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Revised: 05/12/2016] [Accepted: 07/20/2016] [Indexed: 12/04/2022]

May DH, Timmins-Schiffman E, Mikan MP, Harvey HR, Borenstein E, Nunn BL, Noble WS. An Alignment-Free "Metapeptide" Strategy for Metaproteomic Characterization of Microbiome Samples Using Shotgun Metagenomic Sequencing. J Proteome Res 2016;15:2697-705. [PMID: 27396978 PMCID: PMC5116374 DOI: 10.1021/acs.jproteome.6b00239] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Savitski MM, Wilhelm M, Hahne H, Kuster B, Bantscheff M. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets. Mol Cell Proteomics 2015;14:2394-404. [PMID: 25987413 DOI: 10.1074/mcp.m114.046995] [Citation(s) in RCA: 325] [Impact Index Per Article: 32.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2014] [Indexed: 02/06/2023] Open

Abstract

Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software.

Collapse

Howbert JJ, Noble WS. Computing exact p-values for a cross-correlation shotgun proteomics score function. Mol Cell Proteomics 2014;13:2467-79. [PMID: 24895379 DOI: 10.1074/mcp.o113.036327] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open

Granholm V, Kim S, Navarro JCF, Sjölund E, Smith RD, Käll L. Fast and accurate database searches with MS-GF+Percolator. J Proteome Res 2013;13:890-7. [PMID: 24344789 DOI: 10.1021/pr400937n] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Serang O, Cansizoglu AE, Käll L, Steen H, Steen JA. Nonparametric Bayesian evaluation of differential protein quantification. J Proteome Res 2013;12:4556-65. [PMID: 24024742 DOI: 10.1021/pr400678m] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]