Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Webb-Robertson BJM, Cannon WR, Oehmen CS, Shah AR, Gurumoorthi V, Lipton MS, Waters KM. A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics. ACTA ACUST UNITED AC 2008;24:1503-9. [PMID: 18453551 DOI: 10.1093/bioinformatics/btn218] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

For:	Webb-Robertson BJM, Cannon WR, Oehmen CS, Shah AR, Gurumoorthi V, Lipton MS, Waters KM. A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics. ACTA ACUST UNITED AC 2008;24:1503-9. [PMID: 18453551 DOI: 10.1093/bioinformatics/btn218] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Number

Cited by Other Article(s)

Kalhor M, Lapin J, Picciani M, Wilhelm M. Rescoring Peptide Spectrum Matches: Boosting Proteomics Performance by Integrating Peptide Property Predictors Into Peptide Identification. Mol Cell Proteomics 2024;23:100798. [PMID: 38871251 PMCID: PMC11269915 DOI: 10.1016/j.mcpro.2024.100798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 05/26/2024] [Accepted: 06/09/2024] [Indexed: 06/15/2024] Open

Gong Y, Ding W, Wang P, Wu Q, Yao X, Yang Q. Evaluating Machine Learning Methods of Analyzing Multiclass Metabolomics. J Chem Inf Model 2023;63:7628-7641. [PMID: 38079572 DOI: 10.1021/acs.jcim.3c01525] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2023]

Abstract

Multiclass metabolomic studies have become popular for revealing the differences in multiple stages of complex diseases, various lifestyles, or the effects of specific treatments. In multiclass metabolomics, there are multiple data manipulation steps for analyzing raw data, which consist of data filtering, the imputation of missing values, data normalization, marker identification, sample separation, classification, and so on. In each step, several to dozens of machine learning methods can be chosen for the given data set, with potentially hundreds or thousands of method combinations in the whole data processing chain. Therefore, a clear understanding of these machine learning methods is helpful for selecting an appropriate method combination for obtaining stable and reliable analytical results of specific data. However, there has rarely been an overall introduction or evaluation of these methods based on multiclass metabolomic data. Herein, detailed descriptions of these machine learning methods in multiple data manipulation steps are reviewed. Moreover, an assessment of these methods was performed using a benchmark data set for multiclass metabolomics. First, 12 imputation methods for imputing missing values were evaluated based on the PSS (Procrustes statistical shape analysis) and NRMSE (normalized root-mean-square error) values. Second, 17 normalization methods for processing multiclass metabolomic data were evaluated by applying the PMAD (pooled median absolute deviation) value. Third, different methods of identifying markers of multiclass metabolomics were evaluated based on the CWrel (relative weighted consistency) value. Fourth, nine classification methods for constructing multiclass models were assessed using the AUC (area under the curve) value. Performance evaluations of machine learning methods are highly recommended to select the most appropriate method combination before performing the final analysis of the given data. Overall, detailed descriptions and evaluation of various machine learning methods are expected to improve analyses of multiclass metabolomic data.

Collapse

Yang Q, Gong Y, Zhu F. Critical Assessment of the Biomarker Discovery and Classification Methods for Multiclass Metabolomics. Anal Chem 2023;95:5542-5552. [PMID: 36944135 DOI: 10.1021/acs.analchem.2c04402] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]

Mohammed Y, Goodlett D, Borchers CH. Bioinformatics Tools and Knowledgebases to Assist Generating Targeted Assays for Plasma Proteomics. Methods Mol Biol 2023;2628:557-577. [PMID: 36781806 DOI: 10.1007/978-1-0716-2978-9_32] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]

Rusilowicz M, Newman DW, Creamer DR, Johnson J, Adair K, Harman VM, Grant CM, Beynon RJ, Hubbard SJ. AlacatDesigner─Computational Design of Peptide Concatamers for Protein Quantitation. J Proteome Res 2023;22:594-604. [PMID: 36688735 PMCID: PMC9903321 DOI: 10.1021/acs.jproteome.2c00608] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Affiliation(s)

Martin Rusilowicz Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
David W. Newman Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
Declan R. Creamer Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
James Johnson GeneMill, Institute of Systems Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United Kingdom
Kareena Adair Centre for Proteome Research, Institute of Systems and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United Kingdom
Victoria M. Harman Centre for Proteome Research, Institute of Systems and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United Kingdom
Chris M. Grant Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
Robert J. Beynon Centre for Proteome Research, Institute of Systems and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United Kingdom
Simon J. Hubbard Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom,*

Collapse

Morales-Martínez A, Bertrand B, Hernández-Meza JM, Garduño-Juárez R, Silva-Sanchez J, Munoz-Garay C. Membrane fluidity, composition, and charge affect the activity and selectivity of the AMP ascaphin-8. Biophys J 2022;121:3034-3048. [PMID: 35842753 PMCID: PMC9463648 DOI: 10.1016/j.bpj.2022.07.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 06/28/2022] [Accepted: 07/12/2022] [Indexed: 12/29/2022] Open

Dincer AB, Lu Y, Schweppe DK, Oh S, Noble WS. Reducing Peptide Sequence Bias in Quantitative Mass Spectrometry Data with Machine Learning. J Proteome Res 2022;21:1771-1782. [PMID: 35696663 DOI: 10.1021/acs.jproteome.2c00211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Gao Z, Chang C, Yang J, Zhu Y, Fu Y. AP3: An Advanced Proteotypic Peptide Predictor for Targeted Proteomics by Incorporating Peptide Digestibility. Anal Chem 2019;91:8705-8711. [DOI: 10.1021/acs.analchem.9b02520] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Zimmer D, Schneider K, Sommer F, Schroda M, Mühlhaus T. Artificial Intelligence Understands Peptide Observability and Assists With Absolute Protein Quantification. FRONTIERS IN PLANT SCIENCE 2018;9:1559. [PMID: 30483279 PMCID: PMC6242780 DOI: 10.3389/fpls.2018.01559] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 10/04/2018] [Indexed: 05/20/2023]

Abstract

Targeted mass spectrometry has become the method of choice to gain absolute quantification information of high quality, which is essential for a quantitative understanding of biological systems. However, the design of absolute protein quantification assays remains challenging due to variations in peptide observability and incomplete knowledge about factors influencing peptide detectability. Here, we present a deep learning algorithm for peptide detectability prediction, d::pPop, which allows the informed selection of synthetic proteotypic peptides for the successful design of targeted proteomics quantification assays. The deep neural network is able to learn a regression model that relates the physicochemical properties of a peptide to its ion intensity detected by mass spectrometry. The approach makes use of experimentally detected deviations from the assumed equimolar abundance of all peptides derived from a given protein. Trained on extensive proteomics datasets, d::pPop's plant and non-plant specific models can predict the quality of proteotypic peptides for not yet experimentally identified proteins. Interrogating the deep neural network after learning from ~76,000 peptides per model organism allows to investigate the impact of different physicochemical properties on the observability of a peptide, thus providing insights into peptide observability as a multifaceted process. Empirical evaluation with rank accuracy metrics showed that our prediction approach outperforms existing algorithms. We circumvent the delicate step of selecting positive and negative training sets and at the same time also more closely reflect the need for selecting the top most promising peptides for targeting a protein of interest. Further, we used an artificial QconCAT protein to experimentally validate the observability prediction. Our proteotypic peptide prediction approach not only facilitates the design of absolute protein quantification assays via a user-friendly web interface but also enables the selection of proteotypic peptides for not yet observed proteins, hence rendering the tool especially useful for plant research.

Collapse

Manes NP, Nita-Lazar A. Application of targeted mass spectrometry in bottom-up proteomics for systems biology research. J Proteomics 2018;189:75-90. [PMID: 29452276 DOI: 10.1016/j.jprot.2018.02.008] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 01/25/2018] [Accepted: 02/07/2018] [Indexed: 02/08/2023]

Abstract

The enormous diversity of proteoforms produces tremendous complexity within cellular proteomes, facilitates intricate networks of molecular interactions, and constitutes a formidable analytical challenge for biomedical researchers. Currently, quantitative whole-proteome profiling often relies on non-targeted liquid chromatography-mass spectrometry (LC-MS), which samples proteoforms broadly, but can suffer from lower accuracy, sensitivity, and reproducibility compared with targeted LC-MS. Recent advances in bottom-up proteomics using targeted LC-MS have enabled previously unachievable identification and quantification of target proteins and posttranslational modifications within complex samples. Consequently, targeted LC-MS is rapidly advancing biomedical research, especially systems biology research in diverse areas that include proteogenomics, interactomics, kinomics, and biological pathway modeling. With the recent development of targeted LC-MS assays for nearly the entire human proteome, targeted LC-MS is positioned to enable quantitative proteomic profiling of unprecedented quality and accessibility to support fundamental and clinical research. Here we review recent applications of bottom-up proteomics using targeted LC-MS for systems biology research. SIGNIFICANCE: Advances in targeted proteomics are rapidly advancing systems biology research. Recent applications include systems-level investigations focused on posttranslational modifications (such as phosphoproteomics), protein conformation, protein-protein interaction, kinomics, proteogenomics, and metabolic and signaling pathways. Notably, absolute quantification of metabolic and signaling pathway proteins has enabled accurate pathway modeling and engineering. Integration of targeted proteomics with other technologies, such as RNA-seq, has facilitated diverse research such as the identification of hundreds of "missing" human proteins (genes and transcripts that appear to encode proteins but direct experimental evidence was lacking).

Collapse

Hoofnagle AN, Whiteaker JR, Carr SA, Kuhn E, Liu T, Massoni SA, Thomas SN, Townsend RR, Zimmerman LJ, Boja E, Chen J, Crimmins DL, Davies SR, Gao Y, Hiltke TR, Ketchum KA, Kinsinger CR, Mesri M, Meyer MR, Qian WJ, Schoenherr RM, Scott MG, Shi T, Whiteley GR, Wrobel JA, Wu C, Ackermann BL, Aebersold R, Barnidge DR, Bunk DM, Clarke N, Fishman JB, Grant RP, Kusebauch U, Kushnir MM, Lowenthal MS, Moritz RL, Neubert H, Patterson SD, Rockwood AL, Rogers J, Singh RJ, Van Eyk JE, Wong SH, Zhang S, Chan DW, Chen X, Ellis MJ, Liebler DC, Rodland KD, Rodriguez H, Smith RD, Zhang Z, Zhang H, Paulovich AG. Recommendations for the Generation, Quantification, Storage, and Handling of Peptides Used for Mass Spectrometry-Based Assays. Clin Chem 2016;62:48-69. [PMID: 26719571 DOI: 10.1373/clinchem.2015.250563] [Citation(s) in RCA: 162] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Affiliation(s)

Andrew N Hoofnagle University of Washington, Seattle, WA;
Jeffrey R Whiteaker Fred Hutchinson Cancer Research Center, Seattle, WA
Steven A Carr Broad Institute, Cambridge, MA
Eric Kuhn Broad Institute, Cambridge, MA
Tao Liu Pacific Northwest National Laboratory, Richland, WA
Sam A Massoni New England Peptide, Inc., Gardner, MA
Stefani N Thomas Johns Hopkins University, Baltimore, MD
R Reid Townsend Washington University, St Louis, MO
Lisa J Zimmerman Vanderbilt University, Nashville, TN
Emily Boja National Cancer Institute, Bethesda, MD
Jing Chen Johns Hopkins University, Baltimore, MD
Daniel L Crimmins Washington University, St Louis, MO
Sherri R Davies Washington University, St Louis, MO
Yuqian Gao Pacific Northwest National Laboratory, Richland, WA
Tara R Hiltke National Cancer Institute, Bethesda, MD
Karen A Ketchum ESAC, Inc., Rockville, MD
Christopher R Kinsinger National Cancer Institute, Bethesda, MD
Mehdi Mesri National Cancer Institute, Bethesda, MD
Matthew R Meyer Washington University, St Louis, MO
Wei-Jun Qian Pacific Northwest National Laboratory, Richland, WA
Regine M Schoenherr Fred Hutchinson Cancer Research Center, Seattle, WA
Mitchell G Scott Washington University, St Louis, MO
Tujin Shi Pacific Northwest National Laboratory, Richland, WA
Gordon R Whiteley Frederick National Laboratory for Cancer Research, Frederick, MD
John A Wrobel University of North Carolina School of Medicine, Chapel Hill, NC
Chaochao Wu Pacific Northwest National Laboratory, Richland, WA
Brad L Ackermann Eli Lilly and Company, Indianapolis, IN
Ruedi Aebersold Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
David R Barnidge Mayo Clinic College of Medicine, Rochester, MN
David M Bunk NIST, Gaithersburg, MD
Nigel Clarke Quest Diagnostics, San Juan Capistrano, CA
Jordan B Fishman 21st Century Biochemicals, Inc., Marlborough, MA
Russ P Grant Laboratory Corporation of America Holdings, Inc., Burlington, NC
Ulrike Kusebauch Institute for Systems Biology, Seattle, WA
Mark M Kushnir University of Utah and ARUP Laboratories, Salt Lake City, UT
Mark S Lowenthal NIST, Gaithersburg, MD
Robert L Moritz Institute for Systems Biology, Seattle, WA
Hendrik Neubert Pfizer, Inc., Andover, MA
Scott D Patterson Gilead Sciences, Inc., Foster City, CA
Alan L Rockwood University of Utah and ARUP Laboratories, Salt Lake City, UT
John Rogers Thermo Fisher Scientific, Rockford, IL
Ravinder J Singh Mayo Clinic College of Medicine, Rochester, MN
Jennifer E Van Eyk Cedars Sinai Medical Center, Los Angeles, CA
Steven H Wong Wake Forest School of Medicine, Winston-Salem, NC
Shucha Zhang Enanta Pharmaceuticals, Watertown, MA
Daniel W Chan Johns Hopkins University, Baltimore, MD
Xian Chen University of North Carolina School of Medicine, Chapel Hill, NC
Matthew J Ellis Baylor College of Medicine, Houston, TX
Daniel C Liebler Vanderbilt University, Nashville, TN
Karin D Rodland Pacific Northwest National Laboratory, Richland, WA
Henry Rodriguez National Cancer Institute, Bethesda, MD
Richard D Smith Pacific Northwest National Laboratory, Richland, WA
Zhen Zhang Johns Hopkins University, Baltimore, MD
Hui Zhang Johns Hopkins University, Baltimore, MD
Amanda G Paulovich Fred Hutchinson Cancer Research Center, Seattle, WA;

Collapse

Ma S, Downard KM, Wong JW. FluClass: A novel algorithm and approach to score and visualize the phylogeny of the influenza virus using mass spectrometry. Anal Chim Acta 2015;895:54-61. [DOI: 10.1016/j.aca.2015.09.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Revised: 08/29/2015] [Accepted: 09/03/2015] [Indexed: 10/23/2022]

Building high-quality assay libraries for targeted analysis of SWATH MS data. Nat Protoc 2015;10:426-41. [PMID: 25675208 DOI: 10.1038/nprot.2015.015] [Citation(s) in RCA: 238] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

An Advanced Partial Discharge Recognition Strategy of Power Cable. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING 2015. [DOI: 10.1155/2015/174538] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Muntel J, Boswell SA, Tang S, Ahmed S, Wapinski I, Foley G, Steen H, Springer M. Abundance-based classifier for the prediction of mass spectrometric peptide detectability upon enrichment (PPA). Mol Cell Proteomics 2014;14:430-40. [PMID: 25473088 DOI: 10.1074/mcp.m114.044321] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Mohammed Y, Domański D, Jackson AM, Smith DS, Deelder AM, Palmblad M, Borchers CH. PeptidePicker: A scientific workflow with web interface for selecting appropriate peptides for targeted proteomics experiments. J Proteomics 2014;106:151-61. [DOI: 10.1016/j.jprot.2014.04.018] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Revised: 04/08/2014] [Accepted: 04/10/2014] [Indexed: 01/08/2023]

Schliekelman P, Liu S. Quantifying the effect of competition for detection between coeluting peptides on detection probabilities in mass-spectrometry-based proteomics. J Proteome Res 2013;13:348-61. [PMID: 24313442 DOI: 10.1021/pr400034z] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Liebler DC, Zimmerman LJ. Targeted quantitation of proteins by mass spectrometry. Biochemistry 2013;52:3797-806. [PMID: 23517332 PMCID: PMC3674507 DOI: 10.1021/bi400110b] [Citation(s) in RCA: 256] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Zerck A, Nordhoff E, Lehrach H, Reinert K. Optimal precursor ion selection for LC-MALDI MS/MS. BMC Bioinformatics 2013;14:56. [PMID: 23418672 PMCID: PMC3651328 DOI: 10.1186/1471-2105-14-56] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2012] [Accepted: 01/23/2013] [Indexed: 12/30/2022] Open

Methods and Progress of Mass Spectrometry-based Selected Reaction Monitoring*. PROG BIOCHEM BIOPHYS 2012. [DOI: 10.3724/sp.j.1206.2012.00009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Yadav AK, Kumar D, Dash D. Learning from decoys to improve the sensitivity and specificity of proteomics database search results. PLoS One 2012. [PMID: 23189209 PMCID: PMC3506577 DOI: 10.1371/journal.pone.0050651] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Li YF, Radivojac P. Computational approaches to protein inference in shotgun proteomics. BMC Bioinformatics 2012;13 Suppl 16:S4. [PMID: 23176300 PMCID: PMC3489551 DOI: 10.1186/1471-2105-13-s16-s4] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Christin C, Hoefsloot HCJ, Smilde AK, Hoekman B, Suits F, Bischoff R, Horvatovich P. A critical assessment of feature selection methods for biomarker discovery in clinical proteomics. Mol Cell Proteomics 2012;12:263-76. [PMID: 23115301 DOI: 10.1074/mcp.m112.022566] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

Abstract

In this paper, we compare the performance of six different feature selection methods for LC-MS-based proteomics and metabolomics biomarker discovery-t test, the Mann-Whitney-Wilcoxon test (mww test), nearest shrunken centroid (NSC), linear support vector machine-recursive features elimination (SVM-RFE), principal component discriminant analysis (PCDA), and partial least squares discriminant analysis (PLSDA)-using human urine and porcine cerebrospinal fluid samples that were spiked with a range of peptides at different concentration levels. The ideal feature selection method should select the complete list of discriminating features that are related to the spiked peptides without selecting unrelated features. Whereas many studies have to rely on classification error to judge the reliability of the selected biomarker candidates, we assessed the accuracy of selection directly from the list of spiked peptides. The feature selection methods were applied to data sets with different sample sizes and extents of sample class separation determined by the concentration level of spiked compounds. For each feature selection method and data set, the performance for selecting a set of features related to spiked compounds was assessed using the harmonic mean of the recall and the precision (f-score) and the geometric mean of the recall and the true negative rate (g-score). We conclude that the univariate t test and the mww test with multiple testing corrections are not applicable to data sets with small sample sizes (n = 6), but their performance improves markedly with increasing sample size up to a point (n > 12) at which they outperform the other methods. PCDA and PLSDA select small feature sets with high precision but miss many true positive features related to the spiked peptides. NSC strikes a reasonable compromise between recall and precision for all data sets independent of spiking level and number of samples. Linear SVM-RFE performs poorly for selecting features related to the spiked compounds, even though the classification error is relatively low.

Collapse

Bereman MS, MacLean B, Tomazela DM, Liebler DC, MacCoss MJ. The development of selected reaction monitoring methods for targeted proteomics via empirical refinement. Proteomics 2012;12:1134-41. [PMID: 22577014 DOI: 10.1002/pmic.201200042] [Citation(s) in RCA: 87] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Rafalko A, Dai S, Hancock WS, Karger BL, Hincapie M. Development of a Chip/Chip/SRM platform using digital chip isoelectric focusing and LC-Chip mass spectrometry for enrichment and quantitation of low abundance protein biomarkers in human plasma. J Proteome Res 2012;11:808-17. [PMID: 22098410 PMCID: PMC3656385 DOI: 10.1021/pr2006704] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Abstract

Protein biomarkers are critical for diagnosis, prognosis, and treatment of disease. The transition from protein biomarker discovery to verification can be a rate limiting step in clinical development of new diagnostics. Liquid chromatography-selected reaction monitoring mass spectrometry (LC-SRM MS) is becoming an important tool for biomarker verification studies in highly complex biological samples. Analyte enrichment or sample fractionation is often necessary to reduce sample complexity and improve sensitivity of SRM for quantitation of clinically relevant biomarker candidates present at the low ng/mL range in blood. In this paper, we describe an alternative method for sample preparation for LC-SRM MS, which does not rely on availability of antibodies. This new platform is based on selective enrichment of proteotypic peptides from complex biological peptide mixtures via isoelectric focusing (IEF) on a digital ProteomeChip (dPC) for SRM quantitation using a triple quadrupole (QQQ) instrument with an LC-Chip (Chip/Chip/SRM). To demonstrate the value of this approach, the optimization of the Chip/Chip/SRM platform was performed using prostate specific antigen (PSA) added to female plasma as a model system. The combination of immunodepletion of albumin and IgG with peptide fractionation on the dPC, followed by SRM analysis, resulted in a limit of quantitation of PSA added to female plasma at the level of ∼1-2.5 ng/mL with a CV of ∼13%. The optimized platform was applied to measure levels of PSA in plasma of a small cohort of male patients with prostate cancer (PCa) and healthy matched controls with concentrations ranging from 1.5 to 25 ng/mL. A good correlation (r(2) = 0.9459) was observed between standard clinical ELISA tests and the SRM-based assay. Our data demonstrate that the combination of IEF on the dPC and SRM (Chip/Chip/SRM) can be successfully applied for verification of low abundance protein biomarkers in complex samples.

Collapse

Noble WS, MacCoss MJ. Computational and statistical analysis of protein mass spectrometry data. PLoS Comput Biol 2012;8:e1002296. [PMID: 22291580 PMCID: PMC3266873 DOI: 10.1371/journal.pcbi.1002296] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Cannon WR, Rawlins MM, Baxter DJ, Callister SJ, Lipton MS, Bryant DA. Large improvements in MS/MS-based peptide identification rates using a hybrid analysis. J Proteome Res 2011;10:2306-17. [PMID: 21391700 DOI: 10.1021/pr101130b] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]

Gallien S, Duriez E, Domon B. Selected reaction monitoring applied to proteomics. JOURNAL OF MASS SPECTROMETRY : JMS 2011;46:298-312. [PMID: 21394846 DOI: 10.1002/jms.1895] [Citation(s) in RCA: 202] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

Alves G, Ogurtsov AY, Yu YK. Assigning statistical significance to proteotypic peptides via database searches. J Proteomics 2011;74:199-211. [PMID: 21055489 PMCID: PMC3186061 DOI: 10.1016/j.jprot.2010.10.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Revised: 10/18/2010] [Accepted: 10/21/2010] [Indexed: 11/19/2022]

Liu C, Li H. In silico prediction of post-translational modifications. Methods Mol Biol 2011;760:325-340. [PMID: 21780006 DOI: 10.1007/978-1-61779-176-5_20] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Li YF, Arnold RJ, Tang H, Radivojac P. The importance of peptide detectability for protein identification, quantification, and experiment design in MS/MS proteomics. J Proteome Res 2010;9:6288-97. [PMID: 21067214 PMCID: PMC3006185 DOI: 10.1021/pr1005586] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Shah AR, Agarwal K, Baker ES, Singhal M, Mayampurath AM, Ibrahim YM, Kangas LJ, Monroe ME, Zhao R, Belov ME, Anderson GA, Smith RD. Machine learning based prediction for peptide drift times in ion mobility spectrometry. Bioinformatics 2010;26:1601-7. [PMID: 20495001 PMCID: PMC2913656 DOI: 10.1093/bioinformatics/btq245] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2010] [Revised: 04/18/2010] [Accepted: 05/02/2010] [Indexed: 11/14/2022] Open

Hewel JA, Liu J, Onishi K, Fong V, Chandran S, Olsen JB, Pogoutse O, Schutkowski M, Wenschuh H, Winkler DFH, Eckler L, Zandstra PW, Emili A. Synthetic peptide arrays for pathway-level protein monitoring by liquid chromatography-tandem mass spectrometry. Mol Cell Proteomics 2010;9:2460-73. [PMID: 20467045 DOI: 10.1074/mcp.m900456-mcp200] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Liang G, Zhao W. Using factor analysis scales of generalized amino acid information for prediction and characteristic analysis of β-turns in proteins based on a support vector machine model. Sci China Chem 2010. [DOI: 10.1007/s11426-010-0165-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Advance of Peptide Detectability Prediction on Mass Spectrometry Platform in Proteomics. CHINESE JOURNAL OF ANALYTICAL CHEMISTRY 2010. [DOI: 10.3724/sp.j.1096.2010.00286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Riffle M, Eng JK. Proteomics data repositories. Proteomics 2010;9:4653-63. [PMID: 19795424 DOI: 10.1002/pmic.200900216] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Mujezinovic N, Schneider G, Wildpaner M, Mechtler K, Eisenhaber F. Reducing the haystack to find the needle: improved protein identification after fast elimination of non-interpretable peptide MS/MS spectra and noise reduction. BMC Genomics 2010;11 Suppl 1:S13. [PMID: 20158870 PMCID: PMC2822527 DOI: 10.1186/1471-2164-11-s1-s13] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

XU CM, ZHANG JY, LIU H, SUN HC, ZHU YP, XIE HW. Advance of Peptide Detectability Prediction on Mass Spectrometry Platform in Proteomics. CHINESE JOURNAL OF ANALYTICAL CHEMISTRY 2010. [DOI: 10.1016/s1872-2040(09)60023-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Cham Mead JA, Bianco L, Bessant C. Free computational resources for designing selected reaction monitoring transitions. Proteomics 2010;10:1106-26. [DOI: 10.1002/pmic.200900396] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Fusaro VA, Mani DR, Mesirov JP, Carr SA. Prediction of high-responding peptides for targeted protein assays by mass spectrometry. Nat Biotechnol 2009;27:190-8. [PMID: 19169245 PMCID: PMC2753399 DOI: 10.1038/nbt.1524] [Citation(s) in RCA: 232] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Accepted: 01/03/2009] [Indexed: 12/21/2022]