Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Serang O. A review of statistical methods for protein identification using tandem mass spectrometry. Stat Interface 2012;5:3-20. [PMID: 22833779 PMCID: PMC3402235 DOI: 10.4310/sii.2012.v5.n1.a2] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]

For:	Serang O. A review of statistical methods for protein identification using tandem mass spectrometry. Stat Interface 2012;5:3-20. [PMID: 22833779 PMCID: PMC3402235 DOI: 10.4310/sii.2012.v5.n1.a2] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]

Number

Cited by Other Article(s)

Lou R, Shui W. Acquisition and Analysis of DIA-Based Proteomic Data: A Comprehensive Survey in 2023. Mol Cell Proteomics 2024;23:100712. [PMID: 38182042 PMCID: PMC10847697 DOI: 10.1016/j.mcpro.2024.100712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/27/2023] [Accepted: 01/02/2024] [Indexed: 01/07/2024] Open

Miller RM, Smith LM. Overview and considerations in bottom-up proteomics. Analyst 2023;148:475-486. [PMID: 36383138 PMCID: PMC9898146 DOI: 10.1039/d2an01246d] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Grabowsky ER, Saviola AJ, Alvarado-Díaz J, Mascareñas AQ, Hansen KC, Yates JR, Mackessy SP. Montane Rattlesnakes in México: Venoms of Crotalus tancitarensis and Related Species within the Crotalus intermedius Group. Toxins (Basel) 2023;15:72. [PMID: 36668891 PMCID: PMC9867100 DOI: 10.3390/toxins15010072] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 01/04/2023] [Accepted: 01/10/2023] [Indexed: 01/15/2023] Open

Abstract

The Crotalus intermedius group is a clade of rattlesnakes consisting of several species adapted to a high elevation habitat, primarily in México. Crotalus tancitarensis was previously classified as C. intermedius, until individuals occurring on Cerro Tancítaro in Michoacán, México, were reevaluated and classified as a new species (C. tancitarensis) based on scale pattern and geographic location. This study aimed to characterize the venom of C. tancitarensis and compare the venom profile to those of other species within the Crotalus intermedius group using gel electrophoresis, biochemical assays, reverse-phase high performance liquid chromatography, mass spectrometry, and lethal toxicity (LD50) assays. Results show that the venom profiles of species within the Crotalus intermedius group are similar, but with distinct differences in phospholipase A2 (PLA2), metalloproteinase PI (SVMP PI), and kallikrein-like serine proteinase (SVSP) activity and relative abundance. Proteomic analysis indicated that the highland forms produce venoms with 50-60 protein isoforms and a composition typical of type I rattlesnake venoms (abundant SVMPs, lack of presynaptic PLA2-based neurotoxins), as well as a diversity of typical Crotalus venom components such as serine proteinases, PLA2s, C-type lectins, and less abundant toxins (LAAOs, CRiSPs, etc.). The overall venom profile of C. tancitarensis appears most similar to C. transversus, which is consistent with a previous mitochondrial DNA analysis of the Crotalus intermedius group. These rattlesnakes of the Mexican highlands represent a radiation of high elevation specialists, and in spite of divergence of species in these Sky Island habitats, venom composition of species analyzed here has remained relatively conserved. The majority of protein family isoforms are conserved in all members of the clade, and as seen in other more broadly distributed rattlesnake species, differences in their venoms are largely due to relative concentrations of specific components.

Collapse

Cunsolo V, Di Francesco A, Pittalà MGG, Saletti R, Foti S. The TriMet_DB: A Manually Curated Database of the Metabolic Proteins of Triticum aestivum. Nutrients 2022;14:nu14245377. [PMID: 36558536 PMCID: PMC9781733 DOI: 10.3390/nu14245377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 12/07/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022] Open

Reanalysis of ProteomicsDB Using an Accurate, Sensitive, and Scalable False Discovery Rate Estimation Approach for Protein Groups. Mol Cell Proteomics 2022;21:100437. [PMID: 36328188 PMCID: PMC9718969 DOI: 10.1016/j.mcpro.2022.100437] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 10/16/2022] [Accepted: 10/28/2022] [Indexed: 11/07/2022] Open

Abstract

Estimating false discovery rates (FDRs) of protein identification continues to be an important topic in mass spectrometry-based proteomics, particularly when analyzing very large datasets. One performant method for this purpose is the Picked Protein FDR approach which is based on a target-decoy competition strategy on the protein level that ensures that FDRs scale to large datasets. Here, we present an extension to this method that can also deal with protein groups, that is, proteins that share common peptides such as protein isoforms of the same gene. To obtain well-calibrated FDR estimates that preserve protein identification sensitivity, we introduce two novel ideas. First, the picked group target-decoy and second, the rescued subset grouping strategies. Using entrapment searches and simulated data for validation, we demonstrate that the new Picked Protein Group FDR method produces accurate protein group-level FDR estimates regardless of the size of the data set. The validation analysis also uncovered that applying the commonly used Occam's razor principle leads to anticonservative FDR estimates for large datasets. This is not the case for the Picked Protein Group FDR method. Reanalysis of deep proteomes of 29 human tissues showed that the new method identified up to 4% more protein groups than MaxQuant. Applying the method to the reanalysis of the entire human section of ProteomicsDB led to the identification of 18,000 protein groups at 1% protein group-level FDR. The analysis also showed that about 1250 genes were represented by ≥2 identified protein groups. To make the method accessible to the proteomics community, we provide a software tool including a graphical user interface that enables merging results from multiple MaxQuant searches into a single list of identified and quantified protein groups.

Collapse

Miller RM, Jordan BT, Mehlferber MM, Jeffery ED, Chatzipantsiou C, Kaur S, Millikin RJ, Dai Y, Tiberi S, Castaldi PJ, Shortreed MR, Luckey CJ, Conesa A, Smith LM, Deslattes Mays A, Sheynkman GM. Enhanced protein isoform characterization through long-read proteogenomics. Genome Biol 2022;23:69. [PMID: 35241129 PMCID: PMC8892804 DOI: 10.1186/s13059-022-02624-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 02/02/2022] [Indexed: 02/04/2023] Open

Affiliation(s)

Rachel M. Miller grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
Ben T. Jordan grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA
Madison M. Mehlferber grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA ,3grid.27755.320000 0000 9136 933XDepartment of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA USA
Erin D. Jeffery grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA
Christina Chatzipantsiou Lifebit Biotech LTD., London, UK
Simi Kaur grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
Robert J. Millikin grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
Yunxiang Dai grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
Simone Tiberi grid.7400.30000 0004 1937 0650Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland ,6grid.7400.30000 0004 1937 0650Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
Peter J. Castaldi grid.62560.370000 0004 0378 8294Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA USA ,8grid.62560.370000 0004 0378 8294Division of General Medicine and Primary Care, Brigham and Women’s Hospital, Boston, MA USA
Michael R. Shortreed grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
Chance John Luckey grid.27755.320000 0000 9136 933XDepartment of Pathology, University of Virginia, Charlottesville, VA USA
Ana Conesa grid.4711.30000 0001 2183 4846Institute for Integrative Systems Biology, Spanish National Research Council (CSIC), Paterna, Spain ,11grid.15276.370000 0004 1936 8091Microbiology and Cell Science Department, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL USA
Lloyd M. Smith grid.14003.360000 0001 2167 3675Department of Chemistry, University of Wisconsin-Madison, Madison, WI USA
Anne Deslattes Mays grid.420089.70000 0000 9635 8082 Office of Data Science and Sharing, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Rockville, MD USA
Gloria M. Sheynkman grid.27755.320000 0000 9136 933XDepartment of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA USA ,13grid.27755.320000 0000 9136 933XCenter for Public Health Genomics, University of Virginia, Charlottesville, VA USA ,14grid.27755.320000 0000 9136 933XUVA Cancer Center, University of Virginia, Charlottesville, VA USA

Collapse

Simopoulos CMA, Figeys D, Lavallée-Adam M. Novel Bioinformatics Strategies Driving Dynamic Metaproteomic Studies. Methods Mol Biol 2022;2456:319-338. [PMID: 35612752 DOI: 10.1007/978-1-0716-2124-0_22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Van Den Bossche T, Kunath BJ, Schallert K, Schäpe SS, Abraham PE, Armengaud J, Arntzen MØ, Bassignani A, Benndorf D, Fuchs S, Giannone RJ, Griffin TJ, Hagen LH, Halder R, Henry C, Hettich RL, Heyer R, Jagtap P, Jehmlich N, Jensen M, Juste C, Kleiner M, Langella O, Lehmann T, Leith E, May P, Mesuere B, Miotello G, Peters SL, Pible O, Queiros PT, Reichl U, Renard BY, Schiebenhoefer H, Sczyrba A, Tanca A, Trappe K, Trezzi JP, Uzzau S, Verschaffelt P, von Bergen M, Wilmes P, Wolf M, Martens L, Muth T. Critical Assessment of MetaProteome Investigation (CAMPI): a multi-laboratory comparison of established workflows. Nat Commun 2021;12:7305. [PMID: 34911965 PMCID: PMC8674281 DOI: 10.1038/s41467-021-27542-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 11/24/2021] [Indexed: 12/17/2022] Open

Affiliation(s)

Tim Van Den Bossche VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
Benoit J Kunath Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Kay Schallert Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Stephanie S Schäpe Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
Paul E Abraham Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Jean Armengaud Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
Magnus Ø Arntzen Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
Ariane Bassignani INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
Dirk Benndorf Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany Microbiology, Department of Applied Biosciences and Process Technology, Anhalt University of Applied Sciences, Köthen, Germany Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
Stephan Fuchs Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
Richard J Giannone Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Timothy J Griffin Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
Live H Hagen Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
Rashi Halder Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Céline Henry INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
Robert L Hettich Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Robert Heyer Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Pratik Jagtap Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
Nico Jehmlich Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
Marlene Jensen Department of Plant & Microbial Biology, North Carolina State University, Raleigh, USA
Catherine Juste INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
Manuel Kleiner Department of Plant & Microbial Biology, North Carolina State University, Raleigh, USA
Olivier Langella Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
Theresa Lehmann Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Emma Leith Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
Patrick May Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Bart Mesuere VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Guylaine Miotello Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
Samantha L Peters Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Olivier Pible Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
Pedro T Queiros Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Udo Reichl Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
Bernhard Y Renard Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany Data Analytics and Computational Statistics, Hasso-Plattner-Institute, Faculty of Digital Engineering, University of Potsdam, Potsdam, Germany
Henning Schiebenhoefer Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany Data Analytics and Computational Statistics, Hasso-Plattner-Institute, Faculty of Digital Engineering, University of Potsdam, Potsdam, Germany
Alexander Sczyrba Faculty of Technology, Bielefeld University, Bielefeld, Germany
Alessandro Tanca Department of Biomedical Sciences, University of Sassari, Sassari, Italy
Kathrin Trappe Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
Jean-Pierre Trezzi Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg Integrated Biobank of Luxembourg, Luxembourg Institute of Health, 1, rue Louis Rech, L-3555, Dudelange, Luxembourg
Sergio Uzzau Department of Biomedical Sciences, University of Sassari, Sassari, Italy
Pieter Verschaffelt VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Martin von Bergen Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
Paul Wilmes Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, 6 avenue du Swing, L-4367, Belvaux, Luxembourg
Maximilian Wolf Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Lennart Martens VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium. Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium.
Thilo Muth Section eScience (S.3), Federal Institute for Materials Research and Testing, Berlin, Germany

Collapse

Van Den Bossche T, Kunath BJ, Schallert K, Schäpe SS, Abraham PE, Armengaud J, Arntzen MØ, Bassignani A, Benndorf D, Fuchs S, Giannone RJ, Griffin TJ, Hagen LH, Halder R, Henry C, Hettich RL, Heyer R, Jagtap P, Jehmlich N, Jensen M, Juste C, Kleiner M, Langella O, Lehmann T, Leith E, May P, Mesuere B, Miotello G, Peters SL, Pible O, Queiros PT, Reichl U, Renard BY, Schiebenhoefer H, Sczyrba A, Tanca A, Trappe K, Trezzi JP, Uzzau S, Verschaffelt P, von Bergen M, Wilmes P, Wolf M, Martens L, Muth T. Critical Assessment of MetaProteome Investigation (CAMPI): a multi-laboratory comparison of established workflows. Nat Commun 2021;12:7305. [PMID: 34911965 DOI: 10.1101/2021.03.05.433915] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 11/24/2021] [Indexed: 05/21/2023] Open

Affiliation(s)

Tim Van Den Bossche VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
Benoit J Kunath Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Kay Schallert Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Stephanie S Schäpe Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
Paul E Abraham Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Jean Armengaud Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
Magnus Ø Arntzen Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
Ariane Bassignani INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
Dirk Benndorf Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany Microbiology, Department of Applied Biosciences and Process Technology, Anhalt University of Applied Sciences, Köthen, Germany Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
Stephan Fuchs Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
Richard J Giannone Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Timothy J Griffin Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
Live H Hagen Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
Rashi Halder Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Céline Henry INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
Robert L Hettich Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Robert Heyer Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Pratik Jagtap Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
Nico Jehmlich Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
Marlene Jensen Department of Plant & Microbial Biology, North Carolina State University, Raleigh, USA
Catherine Juste INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
Manuel Kleiner Department of Plant & Microbial Biology, North Carolina State University, Raleigh, USA
Olivier Langella Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
Theresa Lehmann Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Emma Leith Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
Patrick May Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Bart Mesuere VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Guylaine Miotello Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
Samantha L Peters Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
Olivier Pible Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
Pedro T Queiros Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
Udo Reichl Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
Bernhard Y Renard Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany Data Analytics and Computational Statistics, Hasso-Plattner-Institute, Faculty of Digital Engineering, University of Potsdam, Potsdam, Germany
Henning Schiebenhoefer Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany Data Analytics and Computational Statistics, Hasso-Plattner-Institute, Faculty of Digital Engineering, University of Potsdam, Potsdam, Germany
Alexander Sczyrba Faculty of Technology, Bielefeld University, Bielefeld, Germany
Alessandro Tanca Department of Biomedical Sciences, University of Sassari, Sassari, Italy
Kathrin Trappe Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
Jean-Pierre Trezzi Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg Integrated Biobank of Luxembourg, Luxembourg Institute of Health, 1, rue Louis Rech, L-3555, Dudelange, Luxembourg
Sergio Uzzau Department of Biomedical Sciences, University of Sassari, Sassari, Italy
Pieter Verschaffelt VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
Martin von Bergen Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
Paul Wilmes Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, 6 avenue du Swing, L-4367, Belvaux, Luxembourg
Maximilian Wolf Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
Lennart Martens VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium. Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium.
Thilo Muth Section eScience (S.3), Federal Institute for Materials Research and Testing, Berlin, Germany

Collapse

Cantrell LS, Schey KL. Data-Independent Acquisition Mass Spectrometry of the Human Lens Enhances Spatiotemporal Measurement of Fiber Cell Aging. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021;32:2755-2765. [PMID: 34705440 PMCID: PMC9685647 DOI: 10.1021/jasms.1c00193] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Kirchner M, Deng H, Xu Y. Heterogeneity in proline hydroxylation of fibrillar collagens observed by mass spectrometry. PLoS One 2021;16:e0250544. [PMID: 34464391 PMCID: PMC8407550 DOI: 10.1371/journal.pone.0250544] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 06/28/2021] [Indexed: 01/22/2023] Open

Abstract

Collagen is the major protein in the extracellular matrix and plays vital roles in tissue development and function. Collagen is also one of the most processed proteins in its biosynthesis. The most prominent post-translational modification (PTM) of collagen is the hydroxylation of Pro residues in the Y-position of the characteristic (Gly-Xaa-Yaa) repeating amino acid sequence of a collagen triple helix. Recent studies using mass spectrometry (MS) and tandem MS sequencing (MS/MS) have revealed unexpected hydroxylation of Pro residues in the X-positions (X-Hyp). The newly identified X-Hyp residues appear to be highly heterogeneous in location and percent occupancy. In order to understand the dynamic nature of the new X-Hyps and their potential impact on applications of MS and MS/MS for collagen research, we sampled four different collagen samples using standard MS and MS/MS techniques. We found considerable variations in the degree of PTMs of the same collagen from different organisms and/or tissues. The rat tail tendon type I collagen is particularly variable in terms of both over-hydroxylation of Pro in the X-position and under-hydroxylation of Pro in the Y-position. In contrast, only a few unexpected PTMs in collagens type I and type III from human placenta were observed. Some observations are not reproducible between different sequencing efforts of the same sample, presumably due to a low population and/or the unpredictable nature of the ionization process. Additionally, despite the heterogeneous preparation and sourcing, collagen samples from commercial sources do not show elevated variations in PTMs compared to samples prepared from a single tissue and/or organism. These findings will contribute to the growing body of information regarding the PTMs of collagen by MS technology, and culminate to a more comprehensive understanding of the extent and the functional roles of the PTMs of collagen.

Collapse

Rozanova S, Barkovits K, Nikolov M, Schmidt C, Urlaub H, Marcus K. Quantitative Mass Spectrometry-Based Proteomics: An Overview. Methods Mol Biol 2021;2228:85-116. [PMID: 33950486 DOI: 10.1007/978-1-0716-1024-4_8] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Peng Y, Jain S, Li YF, Greguš M, Ivanov AR, Vitek O, Radivojac P. New mixture models for decoy-free false discovery rate estimation in mass spectrometry proteomics. Bioinformatics 2020;36:i745-i753. [PMID: 33381824 DOI: 10.1093/bioinformatics/btaa807] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Abstract

MOTIVATION

Accurate estimation of false discovery rate (FDR) of spectral identification is a central problem in mass spectrometry-based proteomics. Over the past two decades, target-decoy approaches (TDAs) and decoy-free approaches (DFAs) have been widely used to estimate FDR. TDAs use a database of decoy species to faithfully model score distributions of incorrect peptide-spectrum matches (PSMs). DFAs, on the other hand, fit two-component mixture models to learn the parameters of correct and incorrect PSM score distributions. While conceptually straightforward, both approaches lead to problems in practice, particularly in experiments that push instrumentation to the limit and generate low fragmentation-efficiency and low signal-to-noise-ratio spectra.

RESULTS

We introduce a new decoy-free framework for FDR estimation that generalizes present DFAs while exploiting more search data in a manner similar to TDAs. Our approach relies on multi-component mixtures, in which score distributions corresponding to the correct PSMs, best incorrect PSMs and second-best incorrect PSMs are modeled by the skew normal family. We derive EM algorithms to estimate parameters of these distributions from the scores of best and second-best PSMs associated with each experimental spectrum. We evaluate our models on multiple proteomics datasets and a HeLa cell digest case study consisting of more than a million spectra in total. We provide evidence of improved performance over existing DFAs and improved stability and speed over TDAs without any performance degradation. We propose that the new strategy has the potential to extend beyond peptide identification and reduce the need for TDA on all analytical platforms.

AVAILABILITYAND IMPLEMENTATION

https://github.com/shawn-peng/FDR-estimation.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Couté Y, Bruley C, Burger T. Beyond Target-Decoy Competition: Stable Validation of Peptide and Protein Identifications in Mass Spectrometry-Based Discovery Proteomics. Anal Chem 2020;92:14898-14906. [PMID: 32970414 DOI: 10.1021/acs.analchem.0c00328] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Aitekenov S, Gaipov A, Bukasov R. Review: Detection and quantification of proteins in human urine. Talanta 2020;223:121718. [PMID: 33303164 PMCID: PMC7554478 DOI: 10.1016/j.talanta.2020.121718] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 09/23/2020] [Accepted: 09/26/2020] [Indexed: 12/31/2022]

Abstract

Extensive medical research showed that patients, with high protein concentration in urine, have various kinds of kidney diseases, referred to as proteinuria. Urinary protein biomarkers are useful for diagnosis of many health conditions – kidney and cardio vascular diseases, cancers, diabetes, infections. This review focuses on the instrumental quantification (electrophoresis, chromatography, immunoassays, mass spectrometry, fluorescence spectroscopy, the infrared spectroscopy, and Raman spectroscopy) of proteins (the most of all albumin) in human urine matrix. Different techniques provide unique information on what constituents of the urine are. Due to complex nature of urine, a separation step by electrophoresis or chromatography are often used for proteomics study of urine. Mass spectrometry is a powerful tool for the discovery and the analysis of biomarkers in urine, however, costs of the analysis are high, especially for quantitative analysis. Immunoassays, which often come with fluorescence detection, are major qualitative and quantitative tools in clinical analysis. While Infrared and Raman spectroscopies do not give extensive information about urine, they could become important tools for the routine clinical diagnostics of kidney problems, due to rapidness and low-cost. Thus, it is important to review all the applicable techniques and methods related to urine analysis. In this review, a brief overview of each technique's principle is introduced. Where applicable, research papers about protein determination in urine are summarized with the main figures of merits, such as the limit of detection, the detectable range, recovery and accuracy, when available.

•

Urinary protein biomarkers are useful for diagnosis of many conditions: kidney and cardio vascular diseases, cancers.

•

Liquid chromatography – mass spectroscopy is a powerful tool for urine proteomics, but used mostly in science.

•

Immunoassays are widely used in both clinical and bio-analytical laboratories.

•

IR and Raman spectroscopies are promising tools for diagnostics of urine due to low-cost and rapidness.

Collapse

Simopoulos CMA, Ning Z, Zhang X, Li L, Walker K, Lavallée-Adam M, Figeys D. pepFunk: a tool for peptide-centric functional analysis of metaproteomic human gut microbiome studies. Bioinformatics 2020;36:4171-4179. [DOI: 10.1093/bioinformatics/btaa289] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 03/20/2020] [Accepted: 04/27/2020] [Indexed: 12/13/2022] Open

Abstract Abstract Motivation Enzymatic digestion of proteins before mass spectrometry analysis is a key process in metaproteomic workflows. Canonical metaproteomic data processing pipelines typically involve matching spectra produced by the mass spectrometer to a theoretical spectra database, followed by matching the identified peptides back to parent-proteins. However, the nature of enzymatic digestion produces peptides that can be found in multiple proteins due to conservation or chance, presenting difficulties with protein and functional assignment. Results To combat this challenge, we developed pepFunk, a peptide-centric metaproteomic workflow focused on the analysis of human gut microbiome samples. Our workflow includes a curated peptide database annotated with Kyoto Encyclopedia of Genes and Genomes (KEGG) terms and a gene set variation analysis-inspired pathway enrichment adapted for peptide-level data. Analysis using our peptide-centric workflow is fast and highly correlated to a protein-centric analysis, and can identify more enriched KEGG pathways than analysis using protein-level data. Our workflow is open source and available as a web application or source code to be run locally. Availability and implementation pepFunk is available online as a web application at https://shiny.imetalab.ca/pepFunk/ with open-source code available from https://github.com/northomics/pepFunk. Contact dfigeys@uottawa.ca Supplementary information Supplementary data are available at Bioinformatics online. Collapse

Affiliation(s)

Caitlin M A Simopoulos Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada Faculty of Medicine, SIMM-University of Ottawa Joint Research Center in Systems and Personalized Pharmacology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
Zhibin Ning Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada Faculty of Medicine, SIMM-University of Ottawa Joint Research Center in Systems and Personalized Pharmacology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
Xu Zhang Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada Faculty of Medicine, SIMM-University of Ottawa Joint Research Center in Systems and Personalized Pharmacology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
Leyuan Li Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada Faculty of Medicine, SIMM-University of Ottawa Joint Research Center in Systems and Personalized Pharmacology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
Krystal Walker Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada Faculty of Medicine, SIMM-University of Ottawa Joint Research Center in Systems and Personalized Pharmacology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
Mathieu Lavallée-Adam Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
Daniel Figeys Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada Faculty of Medicine, SIMM-University of Ottawa Joint Research Center in Systems and Personalized Pharmacology, University of Ottawa, Ottawa, ON K1H 8M5, Canada Canadian Institute for Advanced Research, Toronto, ON M5G 1M1, Canada

Collapse

Prieto G, Vázquez J. Protein Probability Model for High-Throughput Protein Identification by Mass Spectrometry-Based Proteomics. J Proteome Res 2020;19:1285-1297. [PMID: 32037837 DOI: 10.1021/acs.jproteome.9b00819] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Pfeuffer J, Sachsenberg T, Dijkstra TMH, Serang O, Reinert K, Kohlbacher O. EPIFANY: A Method for Efficient High-Confidence Protein Inference. J Proteome Res 2020;19:1060-1072. [PMID: 31975601 DOI: 10.1021/acs.jproteome.9b00566] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Matthiesen R, Prieto G, Beck HC. Comparing Peptide Spectra Matches Across Search Engines. Methods Mol Biol 2020;2051:133-143. [PMID: 31552627 DOI: 10.1007/978-1-4939-9744-2_5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Inferring Protein-Protein Interaction Networks From Mass Spectrometry-Based Proteomic Approaches: A Mini-Review. Comput Struct Biotechnol J 2019;17:805-811. [PMID: 31316724 PMCID: PMC6611912 DOI: 10.1016/j.csbj.2019.05.007] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Revised: 05/20/2019] [Accepted: 05/26/2019] [Indexed: 01/06/2023] Open

Schiebenhoefer H, Van Den Bossche T, Fuchs S, Renard BY, Muth T, Martens L. Challenges and promise at the interface of metaproteomics and genomics: an overview of recent progress in metaproteogenomic data analysis. Expert Rev Proteomics 2019;16:375-390. [PMID: 31002542 DOI: 10.1080/14789450.2019.1609944] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Devabhaktuni A, Lin S, Zhang L, Swaminathan K, Gonzalez CG, Olsson N, Pearlman SM, Rawson K, Elias JE. TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets. Nat Biotechnol 2019;37:469-479. [PMID: 30936560 PMCID: PMC6447449 DOI: 10.1038/s41587-019-0067-5] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 02/12/2019] [Indexed: 02/06/2023]

Henning J, Tostengard A, Smith R. A Peptide-Level Fully Annotated Data Set for Quantitative Evaluation of Precursor-Aware Mass Spectrometry Data Processing Algorithms. J Proteome Res 2018;18:392-398. [PMID: 30394759 DOI: 10.1021/acs.jproteome.8b00659] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Discrimination and quantification of homologous keratins from goat and sheep with dual protease digestion and PRM assays. J Proteomics 2018;186:38-46. [DOI: 10.1016/j.jprot.2018.07.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 07/03/2018] [Accepted: 07/13/2018] [Indexed: 01/25/2023]

Jarnuczak AF, Albornoz MG, Eyers CE, Grant CM, Hubbard SJ. A quantitative and temporal map of proteostasis during heat shock in Saccharomyces cerevisiae. Mol Omics 2018;14:37-52. [PMID: 29570196 DOI: 10.1039/c7mo00050b] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Dowsey AW. The need for statistical contributions to bioinformatics at scale, with illustration to mass spectrometry. STAT MODEL 2017. [DOI: 10.1177/1471082x17708519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Rosenberger G, Bludau I, Schmitt U, Heusel M, Hunter CL, Liu Y, MacCoss MJ, MacLean BX, Nesvizhskii AI, Pedrioli PGA, Reiter L, Röst HL, Tate S, Ting YS, Collins BC, Aebersold R. Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses. Nat Methods 2017;14:921-927. [PMID: 28825704 PMCID: PMC5581544 DOI: 10.1038/nmeth.4398] [Citation(s) in RCA: 139] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 07/07/2017] [Indexed: 12/18/2022]

Affiliation(s)

George Rosenberger Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
Isabell Bludau Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD Program in Systems Biology, University of Zurich and ETH Zurich, Zurich, Switzerland
Uwe Schmitt ID Scientific IT Services, ETH Zurich, Zurich, Switzerland
Moritz Heusel Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,PhD program in Molecular and Translational Biomedicine, Competence Center Personalized Medicine (CC-PM), ETH Zurich and University of Zurich, Zurich, Switzerland
Christie L Hunter SCIEX, Redwood City, California, USA
Yansheng Liu Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Michael J MacCoss Department of Genome Sciences, University of Washington, Seattle, Washington, USA
Brendan X MacLean Department of Genome Sciences, University of Washington, Seattle, Washington, USA
Alexey I Nesvizhskii Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA.,Department of Pathology, University of Michigan, Ann Arbor, Michigan, USA
Patrick G A Pedrioli Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Lukas Reiter Biognosys, Schlieren, Switzerland
Hannes L Röst Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Stephen Tate SCIEX, Concord, Ontario, Canada
Ying S Ting Department of Genome Sciences, University of Washington, Seattle, Washington, USA
Ben C Collins Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
Ruedi Aebersold Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland.,Faculty of Science, University of Zurich, Zurich, Switzerland

Collapse

Proteomic differences in amyloid plaques in rapidly progressive and sporadic Alzheimer's disease. Acta Neuropathol 2017;133:933-954. [PMID: 28258398 DOI: 10.1007/s00401-017-1691-0] [Citation(s) in RCA: 128] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2016] [Revised: 02/22/2017] [Accepted: 02/26/2017] [Indexed: 12/16/2022]

Zhang B, Pirmoradian M, Zubarev R, Käll L. Covariation of Peptide Abundances Accurately Reflects Protein Concentration Differences. Mol Cell Proteomics 2017;16:936-948. [PMID: 28302922 PMCID: PMC5417831 DOI: 10.1074/mcp.o117.067728] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Revised: 03/13/2017] [Indexed: 12/29/2022] Open

Abstract

Most implementations of mass spectrometry-based proteomics involve enzymatic digestion of proteins, expanding the analysis to multiple proteolytic peptides for each protein. Currently, there is no consensus of how to summarize peptides' abundances to protein concentrations, and such efforts are complicated by the fact that error control normally is applied to the identification process, and do not directly control errors linking peptide abundance measures to protein concentration. Peptides resulting from suboptimal digestion or being partially modified are not representative of the protein concentration. Without a mechanism to remove such unrepresentative peptides, their abundance adversely impacts the estimation of their protein's concentration. Here, we present a relative quantification approach, Diffacto, that applies factor analysis to extract the covariation of peptides' abundances. The method enables a weighted geometrical average summarization and automatic elimination of incoherent peptides. We demonstrate, based on a set of controlled label-free experiments using standard mixtures of proteins, that the covariation structure extracted by the factor analysis accurately reflects protein concentrations. In the 1% peptide-spectrum match-level FDR data set, as many as 11% of the peptides have abundance differences incoherent with the other peptides attributed to the same protein. If not controlled, such contradicting peptide abundance have a severe impact on protein quantifications. When adding the quantities of each protein's three most abundant peptides, we note as many as 14% of the proteins being estimated as having a negative correlation with their actual concentration differences between samples. Diffacto reduced the amount of such obviously incorrectly quantified proteins to 1.6%. Furthermore, by analyzing clinical data sets from two breast cancer studies, our method revealed the persistent proteomic signatures linked to three subtypes of breast cancer. We conclude that Diffacto can facilitate the interpretation and enhance the utility of most types of proteomics data.

Collapse

The M, MacCoss MJ, Noble WS, Käll L. Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2016;27:1719-1727. [PMID: 27572102 PMCID: PMC5059416 DOI: 10.1007/s13361-016-1460-7] [Citation(s) in RCA: 225] [Impact Index Per Article: 28.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 06/15/2016] [Accepted: 07/20/2016] [Indexed: 05/21/2023]

The M, Tasnim A, Käll L. How to talk about protein-level false discovery rates in shotgun proteomics. Proteomics 2016;16:2461-9. [PMID: 27503675 PMCID: PMC5096025 DOI: 10.1002/pmic.201500431] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Revised: 05/12/2016] [Accepted: 07/20/2016] [Indexed: 12/04/2022]

Riley NM, Bern M, Westphall MS, Coon JJ. Full-Featured Search Algorithm for Negative Electron-Transfer Dissociation. J Proteome Res 2016;15:2768-76. [PMID: 27402189 DOI: 10.1021/acs.jproteome.6b00319] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Muth T, Renard BY, Martens L. Metaproteomic data analysis at a glance: advances in computational microbial community proteomics. Expert Rev Proteomics 2016;13:757-69. [DOI: 10.1080/14789450.2016.1209418] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

McDowell G, Philpott A. New Insights Into the Role of Ubiquitylation of Proteins. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2016;325:35-88. [DOI: 10.1016/bs.ircmb.2016.02.002] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Latosinska A, Vougas K, Makridakis M, Klein J, Mullen W, Abbas M, Stravodimos K, Katafigiotis I, Merseburger AS, Zoidakis J, Mischak H, Vlahou A, Jankowski V. Comparative Analysis of Label-Free and 8-Plex iTRAQ Approach for Quantitative Tissue Proteomic Analysis. PLoS One 2015;10:e0137048. [PMID: 26331617 PMCID: PMC4557910 DOI: 10.1371/journal.pone.0137048] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2015] [Accepted: 08/12/2015] [Indexed: 11/18/2022] Open

Abstract

High resolution proteomics approaches have been successfully utilized for the comprehensive characterization of the cell proteome. However, in the case of quantitative proteomics an open question still remains, which quantification strategy is best suited for identification of biologically relevant changes, especially in clinical specimens. In this study, a thorough comparison of a label-free approach (intensity-based) and 8-plex iTRAQ was conducted as applied to the analysis of tumor tissue samples from non-muscle invasive and muscle-invasive bladder cancer. For the latter, two acquisition strategies were tested including analysis of unfractionated and fractioned iTRAQ-labeled peptides. To reduce variability, aliquots of the same protein extract were used as starting material, whereas to obtain representative results per method further sample processing and MS analysis were conducted according to routinely applied protocols. Considering only multiple-peptide identifications, LC-MS/MS analysis resulted in the identification of 910, 1092 and 332 proteins by label-free, fractionated and unfractionated iTRAQ, respectively. The label-free strategy provided higher protein sequence coverage compared to both iTRAQ experiments. Even though pre-fraction of the iTRAQ labeled peptides allowed for a higher number of identifications, this was not accompanied by a respective increase in the number of differentially expressed changes detected. Validity of the proteomics output related to protein identification and differential expression was determined by comparison to existing data in the field (Protein Atlas and published data on the disease). All methods predicted changes which to a large extent agreed with published data, with label-free providing a higher number of significant changes than iTRAQ. Conclusively, both label-free and iTRAQ (when combined to peptide fractionation) provide high proteome coverage and apparently valid predictions in terms of differential expression, nevertheless label-free provides higher sequence coverage and ultimately detects a higher number of differentially expressed proteins. The risk for receiving false associations still exists, particularly when analyzing highly heterogeneous biological samples, raising the need for the analysis of higher sample numbers and/or application of adjustment for multiple testing.

Collapse

Filip S, Vougas K, Zoidakis J, Latosinska A, Mullen W, Spasovski G, Mischak H, Vlahou A, Jankowski J. Comparison of Depletion Strategies for the Enrichment of Low-Abundance Proteins in Urine. PLoS One 2015. [PMID: 26208298 PMCID: PMC4514849 DOI: 10.1371/journal.pone.0133773] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Abstract

Proteome analysis of complex biological samples for biomarker identification remains challenging, among others due to the extended range of protein concentrations. High-abundance proteins like albumin or IgG of plasma and urine, may interfere with the detection of potential disease biomarkers. Currently, several options are available for the depletion of abundant proteins in plasma. However, the applicability of these methods in urine has not been thoroughly investigated. In this study, we compared different, commercially available immunodepletion and ion-exchange based approaches on urine samples from both healthy subjects and CKD patients, for their reproducibility and efficiency in protein depletion. A starting urine volume of 500 μL was used to simulate conditions of a multi-institutional biomarker discovery study. All depletion approaches showed satisfactory reproducibility (n=5) in protein identification as well as protein abundance. Comparison of the depletion efficiency between the unfractionated and fractionated samples and the different depletion strategies, showed efficient depletion in all cases, with the exception of the ion-exchange kit. The depletion efficiency was found slightly higher in normal than in CKD samples and normal samples yielded more protein identifications than CKD samples when using both initial as well as corresponding depleted fractions. Along these lines, decrease in the amount of albumin and other targets as applicable, following depletion, was observed. Nevertheless, these depletion strategies did not yield a higher number of identifications in neither the urine from normal nor CKD patients. Collectively, when analyzing urine in the context of CKD biomarker identification, no added value of depletion strategies can be observed and analysis of unfractionated starting urine appears to be preferable.

Collapse

Serang O. A Fast Numerical Method for Max-Convolution and the Application to Efficient Max-Product Inference in Bayesian Networks. J Comput Biol 2015;22:770-83. [PMID: 26161499 DOI: 10.1089/cmb.2015.0013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Webb-Robertson BJM, Matzke MM, Datta S, Payne SH, Kang J, Bramer LM, Nicora CD, Shukla AK, Metz TO, Rodland KD, Smith RD, Tardiff MF, McDermott JE, Pounds JG, Waters KM. Bayesian proteoform modeling improves protein quantification of global proteomic measurements. Mol Cell Proteomics 2015;13:3639-46. [PMID: 25433089 DOI: 10.1074/mcp.m113.030932] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Abstract

As the capability of mass spectrometry-based proteomics has matured, tens of thousands of peptides can be measured simultaneously, which has the benefit of offering a systems view of protein expression. However, a major challenge is that, with an increase in throughput, protein quantification estimation from the native measured peptides has become a computational task. A limitation to existing computationally driven protein quantification methods is that most ignore protein variation, such as alternate splicing of the RNA transcript and post-translational modifications or other possible proteoforms, which will affect a significant fraction of the proteome. The consequence of this assumption is that statistical inference at the protein level, and consequently downstream analyses, such as network and pathway modeling, have only limited power for biomarker discovery. Here, we describe a Bayesian Proteoform Quantification model (BP-Quant)(1) that uses statistically derived peptides signatures to identify peptides that are outside the dominant pattern or the existence of multiple overexpressed patterns to improve relative protein abundance estimates. It is a research-driven approach that utilizes the objectives of the experiment, defined in the context of a standard statistical hypothesis, to identify a set of peptides exhibiting similar statistical behavior relating to a protein. This approach infers that changes in relative protein abundance can be used as a surrogate for changes in function, without necessarily taking into account the effect of differential post-translational modifications, processing, or splicing in altering protein function. We verify the approach using a dilution study from mouse plasma samples and demonstrate that BP-Quant achieves similar accuracy as the current state-of-the-art methods at proteoform identification with significantly better specificity. BP-Quant is available as a MatLab® and R packages.

Collapse

Sikdar S, Gill R, Datta S. Improving protein identification from tandem mass spectrometry data by one-step methods and integrating data from other platforms. Brief Bioinform 2015;17:262-9. [PMID: 26141827 DOI: 10.1093/bib/bbv043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Indexed: 01/28/2023] Open

Savitski MM, Wilhelm M, Hahne H, Kuster B, Bantscheff M. A Scalable Approach for Protein False Discovery Rate Estimation in Large Proteomic Data Sets. Mol Cell Proteomics 2015;14:2394-404. [PMID: 25987413 DOI: 10.1074/mcp.m114.046995] [Citation(s) in RCA: 283] [Impact Index Per Article: 31.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2014] [Indexed: 02/06/2023] Open

Abstract

Calculating the number of confidently identified proteins and estimating false discovery rate (FDR) is a challenge when analyzing very large proteomic data sets such as entire human proteomes. Biological and technical heterogeneity in proteomic experiments further add to the challenge and there are strong differences in opinion regarding the conceptual validity of a protein FDR and no consensus regarding the methodology for protein FDR determination. There are also limitations inherent to the widely used classic target-decoy strategy that particularly show when analyzing very large data sets and that lead to a strong over-representation of decoy identifications. In this study, we investigated the merits of the classic, as well as a novel target-decoy-based protein FDR estimation approach, taking advantage of a heterogeneous data collection comprised of ∼19,000 LC-MS/MS runs deposited in ProteomicsDB (https://www.proteomicsdb.org). The "picked" protein FDR approach treats target and decoy sequences of the same protein as a pair rather than as individual entities and chooses either the target or the decoy sequence depending on which receives the highest score. We investigated the performance of this approach in combination with q-value based peptide scoring to normalize sample-, instrument-, and search engine-specific differences. The "picked" target-decoy strategy performed best when protein scoring was based on the best peptide q-value for each protein yielding a stable number of true positive protein identifications over a wide range of q-value thresholds. We show that this simple and unbiased strategy eliminates a conceptual issue in the commonly used "classic" protein FDR approach that causes overprediction of false-positive protein identification in large data sets. The approach scales from small to very large data sets without losing performance, consistently increases the number of true-positive protein identifications and is readily implemented in proteomics analysis software.

Collapse

Alves G, Yu YK. Mass spectrometry-based protein identification with accurate statistical significance assignment. ACTA ACUST UNITED AC 2014;31:699-706. [PMID: 25362092 DOI: 10.1093/bioinformatics/btu717] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Serang O. The probabilistic convolution tree: efficient exact Bayesian inference for faster LC-MS/MS protein inference. PLoS One 2014;9:e91507. [PMID: 24626234 PMCID: PMC3953406 DOI: 10.1371/journal.pone.0091507] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Accepted: 02/12/2014] [Indexed: 11/18/2022] Open

Kelchtermans P, Bittremieux W, De Grave K, Degroeve S, Ramon J, Laukens K, Valkenborg D, Barsnes H, Martens L. Machine learning applications in proteomics research: how the past can boost the future. Proteomics 2014;14:353-66. [PMID: 24323524 DOI: 10.1002/pmic.201300289] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Revised: 09/24/2013] [Accepted: 10/14/2013] [Indexed: 01/22/2023]

Drift time-specific collision energies enable deep-coverage data-independent acquisition proteomics. Nat Methods 2013;11:167-70. [DOI: 10.1038/nmeth.2767] [Citation(s) in RCA: 324] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2013] [Accepted: 11/05/2013] [Indexed: 12/30/2022]

Yang C, He Z, Yu W. A combinatorial perspective of the protein inference problem. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013;10:1542-1547. [PMID: 24407311 DOI: 10.1109/tcbb.2013.110] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Serang O, Cansizoglu AE, Käll L, Steen H, Steen JA. Nonparametric Bayesian evaluation of differential protein quantification. J Proteome Res 2013;12:4556-65. [PMID: 24024742 DOI: 10.1021/pr400678m] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

McDowell GS, Philpott A. Non-canonical ubiquitylation: mechanisms and consequences. Int J Biochem Cell Biol 2013;45:1833-42. [PMID: 23732108 DOI: 10.1016/j.biocel.2013.05.026] [Citation(s) in RCA: 113] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 05/10/2013] [Accepted: 05/22/2013] [Indexed: 01/04/2023]

Serang O, Paulo J, Steen H, Steen JA. A non-parametric cutout index for robust evaluation of identified proteins. Mol Cell Proteomics 2013;12:807-12. [PMID: 23292186 DOI: 10.1074/mcp.o112.022863] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Serang O, Moruz L, Hoopmann MR, Käll L. Recognizing uncertainty increases robustness and reproducibility of mass spectrometry-based protein inferences. J Proteome Res 2012;11:5586-91. [PMID: 23148905 DOI: 10.1021/pr300426s] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Li YF, Radivojac P. Computational approaches to protein inference in shotgun proteomics. BMC Bioinformatics 2012;13 Suppl 16:S4. [PMID: 23176300 PMCID: PMC3489551 DOI: 10.1186/1471-2105-13-s16-s4] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open