Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Elias JE, Gibbons FD, King OD, Roth FP, Gygi SP. Intensity-based protein identification by machine learning from a library of tandem mass spectra. Nat Biotechnol 2004;22:214-9. [PMID: 14730315 DOI: 10.1038/nbt930] [Citation(s) in RCA: 247] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2003] [Accepted: 10/31/2003] [Indexed: 11/09/2022]

For:	Elias JE, Gibbons FD, King OD, Roth FP, Gygi SP. Intensity-based protein identification by machine learning from a library of tandem mass spectra. Nat Biotechnol 2004;22:214-9. [PMID: 14730315 DOI: 10.1038/nbt930] [Citation(s) in RCA: 247] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2003] [Accepted: 10/31/2003] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Adams C, Laukens K, Bittremieux W, Boonen K. Machine learning-based peptide-spectrum match rescoring opens up the immunopeptidome. Proteomics 2024;24:e2300336. [PMID: 38009585 DOI: 10.1002/pmic.202300336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 10/18/2023] [Accepted: 10/23/2023] [Indexed: 11/29/2023]

Skiadopoulou D, Vašíček J, Kuznetsova K, Bouyssié D, Käll L, Vaudel M. Retention Time and Fragmentation Predictors Increase Confidence in Identification of Common Variant Peptides. J Proteome Res 2023;22:3190-3199. [PMID: 37656829 PMCID: PMC10563157 DOI: 10.1021/acs.jproteome.3c00243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Indexed: 09/03/2023]

Yang KL, Yu F, Teo GC, Li K, Demichev V, Ralser M, Nesvizhskii AI. MSBooster: improving peptide identification rates using deep learning-based features. Nat Commun 2023;14:4539. [PMID: 37500632 PMCID: PMC10374903 DOI: 10.1038/s41467-023-40129-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 07/06/2023] [Indexed: 07/29/2023] Open

Geer LY, Lapin J, Slotta DJ, Mak TD, Stein SE. AIomics: Exploring More of the Proteome Using Mass Spectral Libraries Extended by Artificial Intelligence. J Proteome Res 2023;22:2246-2255. [PMID: 37232537 PMCID: PMC10542943 DOI: 10.1021/acs.jproteome.2c00807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Qin J, Guo J, Tang G, Li L, Yao SQ. Multiplex Identification of Post-Translational Modifications at Point-of-Care by Deep Learning-Assisted Hydrogel Sensors. Angew Chem Int Ed Engl 2023;62:e202218412. [PMID: 36815677 DOI: 10.1002/anie.202218412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 02/02/2023] [Accepted: 02/23/2023] [Indexed: 02/24/2023]

Wińska P, Sobiepanek A, Pawlak K, Staniszewska M, Cieśla J. Phosphorylation of Thymidylate Synthase and Dihydrofolate Reductase in Cancer Cells and the Effect of CK2α Silencing. Int J Mol Sci 2023;24:ijms24033023. [PMID: 36769342 PMCID: PMC9917831 DOI: 10.3390/ijms24033023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 01/30/2023] [Accepted: 02/01/2023] [Indexed: 02/08/2023] Open

Searle BC, Shannon AE, Wilburn DB. Scribe: Next Generation Library Searching for DDA Experiments. J Proteome Res 2023;22:482-490. [PMID: 36695531 DOI: 10.1021/acs.jproteome.2c00672] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Cox J. Prediction of peptide mass spectral libraries with machine learning. Nat Biotechnol 2023;41:33-43. [PMID: 36008611 DOI: 10.1038/s41587-022-01424-w] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 07/11/2022] [Indexed: 01/21/2023]

McDonnell K, Howley E, Abram F. Critical evaluation of the use of artificial data for machine learning based de novo peptide identification. Comput Struct Biotechnol J 2023;21:2732-2743. [PMID: 37168871 PMCID: PMC10165132 DOI: 10.1016/j.csbj.2023.04.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 04/16/2023] [Accepted: 04/16/2023] [Indexed: 05/13/2023] Open

Cormican JA, Horokhovskyi Y, Soh WT, Mishto M, Liepe J. inSPIRE: An Open-Source Tool for Increased Mass Spectrometry Identification Rates Using Prosit Spectral Prediction. Mol Cell Proteomics 2022;21:100432. [PMID: 36280141 PMCID: PMC9720494 DOI: 10.1016/j.mcpro.2022.100432] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 10/17/2022] [Accepted: 10/19/2022] [Indexed: 11/05/2022] Open

Bojar D, Lisacek F. Glycoinformatics in the Artificial Intelligence Era. Chem Rev 2022;122:15971-15988. [PMID: 35961636 PMCID: PMC9615983 DOI: 10.1021/acs.chemrev.2c00110] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Indexed: 11/29/2022]

Na S, Choi H, Paek E. Deephos: Predicted spectral database search for TMT-labeled phosphopeptides and its false discovery rate estimation. Bioinformatics 2022;38:2980-2987. [PMID: 35441674 DOI: 10.1093/bioinformatics/btac280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Revised: 03/26/2022] [Accepted: 04/14/2022] [Indexed: 11/14/2022] Open

Liu Y, Wang H, Gui S, Zeng B, Pu J, Zheng P, Zeng L, Luo Y, Wu Y, Zhou C, Song J, Ji P, Wei H, Xie P. Proteomics analysis of the gut-brain axis in a gut microbiota-dysbiosis model of depression. Transl Psychiatry 2021;11:568. [PMID: 34744165 PMCID: PMC8572885 DOI: 10.1038/s41398-021-01689-w] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 10/17/2021] [Accepted: 10/20/2021] [Indexed: 12/21/2022] Open

Affiliation(s)

Yiyun Liu grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Haiyang Wang grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Siwen Gui grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Benhua Zeng grid.410570.70000 0004 1760 6682Department of Laboratory Animal Science, College of Basic Medical Sciences, Third Military Medical University, Chongqing, China
Juncai Pu grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Peng Zheng grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Li Zeng grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Yuanyuan Luo grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
You Wu grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Chanjuan Zhou grid.452206.70000 0004 1758 417XNHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
Jinlin Song grid.203458.80000 0000 8653 0555College of Stomatology, Chongqing Medical University, Chongqing, China
Ping Ji grid.203458.80000 0000 8653 0555College of Stomatology, Chongqing Medical University, Chongqing, China
Hong Wei Department of Laboratory Animal Science, College of Basic Medical Sciences, Third Military Medical University, Chongqing, China.
Peng Xie NHC Key Laboratory of Diagnosis and Treatment on Brain Functional Diseases, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China.

Collapse

Britt HM, Cragnolini T, Thalassinos K. Integration of Mass Spectrometry Data for Structural Biology. Chem Rev 2021;122:7952-7986. [PMID: 34506113 DOI: 10.1021/acs.chemrev.1c00356] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]

Abstract

Mass spectrometry (MS) is increasingly being used to probe the structure and dynamics of proteins and the complexes they form with other macromolecules. There are now several specialized MS methods, each with unique sample preparation, data acquisition, and data processing protocols. Collectively, these methods are referred to as structural MS and include cross-linking, hydrogen-deuterium exchange, hydroxyl radical footprinting, native, ion mobility, and top-down MS. Each of these provides a unique type of structural information, ranging from composition and stoichiometry through to residue level proximity and solvent accessibility. Structural MS has proved particularly beneficial in studying protein classes for which analysis by classic structural biology techniques proves challenging such as glycosylated or intrinsically disordered proteins. To capture the structural details for a particular system, especially larger multiprotein complexes, more than one structural MS method with other structural and biophysical techniques is often required. Key to integrating these diverse data are computational strategies and software solutions to facilitate this process. We provide a background to the structural MS methods and briefly summarize other structural methods and how these are combined with MS. We then describe current state of the art approaches for the integration of structural MS data for structural biology. We quantify how often these methods are used together and provide examples where such combinations have been fruitful. To illustrate the power of integrative approaches, we discuss progress in solving the structures of the proteasome and the nuclear pore complex. We also discuss how information from structural MS, particularly pertaining to protein dynamics, is not currently utilized in integrative workflows and how such information can provide a more accurate picture of the systems studied. We conclude by discussing new developments in the MS and computational fields that will further enable in-cell structural studies.

Collapse

Mann M, Kumar C, Zeng WF, Strauss MT. Artificial intelligence for proteomics and biomarker discovery. Cell Syst 2021;12:759-770. [PMID: 34411543 DOI: 10.1016/j.cels.2021.06.006] [Citation(s) in RCA: 71] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 05/07/2021] [Accepted: 06/28/2021] [Indexed: 12/14/2022]

Feng S, Sterzenbach R, Guo X. Deep learning for peptide identification from metaproteomics datasets. J Proteomics 2021;247:104316. [PMID: 34246788 DOI: 10.1016/j.jprot.2021.104316] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 06/02/2021] [Accepted: 06/18/2021] [Indexed: 10/20/2022]

Taking the leap between analytical chemistry and artificial intelligence: A tutorial review. Anal Chim Acta 2021;1161:338403. [DOI: 10.1016/j.aca.2021.338403] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 03/02/2021] [Accepted: 03/03/2021] [Indexed: 01/01/2023]

Guan S, Bythell BJ. Size Dependent Fragmentation Chemistry of Short Doubly Protonated Tryptic Peptides. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021;32:1020-1032. [PMID: 33779179 DOI: 10.1021/jasms.1c00009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Łącki MK, Startek MP, Brehmer S, Distler U, Tenzer S. OpenTIMS, TimsPy, and TimsR: Open and Easy Access to timsTOF Raw Data. J Proteome Res 2021;20:2122-2129. [PMID: 33724840 DOI: 10.1021/acs.jproteome.0c00962] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Hwang H, Szucs MJ, Ding LJ, Allen A, Ren X, Haensgen H, Gao F, Rhim H, Andrade A, Pan JQ, Carr SA, Ahmad R, Xu W. Neurogranin, Encoded by the Schizophrenia Risk Gene NRGN, Bidirectionally Modulates Synaptic Plasticity via Calmodulin-Dependent Regulation of the Neuronal Phosphoproteome. Biol Psychiatry 2021;89:256-269. [PMID: 33032807 PMCID: PMC9258036 DOI: 10.1016/j.biopsych.2020.07.014] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 07/22/2020] [Accepted: 07/22/2020] [Indexed: 12/22/2022]

Abstract

BACKGROUND

Neurogranin (Ng), encoded by the schizophrenia risk gene NRGN, is a calmodulin-binding protein enriched in the postsynaptic compartments, and its expression is reduced in the postmortem brains of patients with schizophrenia. Experience-dependent translation of Ng is critical for encoding contextual memory, and Ng regulates developmental plasticity in the primary visual cortex during the critical period. However, the overall impact of Ng on the neuronal signaling that regulates synaptic plasticity is unknown.

METHODS

Altered Ng expression was achieved via virus-mediated gene manipulation in mice. The effect on long-term potentiation (LTP) was accessed using spike timing-dependent plasticity protocols. Quantitative phosphoproteomics analyses led to discoveries in significant phosphorylated targets. An identified candidate was examined with high-throughput planar patch clamp and was validated with pharmacological manipulation.

RESULTS

Ng bidirectionally modulated LTP in the hippocampus. Decreasing Ng levels significantly affected the phosphorylation pattern of postsynaptic density proteins, including glutamate receptors, GTPases, kinases, RNA binding proteins, selective ion channels, and ionic transporters, some of which highlighted clusters of schizophrenia- and autism-related genes. Hypophosphorylation of NMDA receptor subunit Grin2A, one significant phosphorylated target, resulted in accelerated decay of NMDA receptor currents. Blocking protein phosphatase PP2B activity rescued the accelerated NMDA receptor current decay and the impairment of LTP mediated by Ng knockdown, implicating the requirement of synaptic PP2B activity for the deficits.

CONCLUSIONS

Altered Ng levels affect the phosphorylation landscape of neuronal proteins. PP2B activity is required for mediating the deficit in synaptic plasticity caused by decreasing Ng levels, revealing a novel mechanistic link of a schizophrenia risk gene to cognitive deficits.

Collapse

Affiliation(s)

Hongik Hwang Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts; Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts; Center for Neuroscience, Brain Science Institute, Korea Institute of Science and Technology, Seoul, Republic of Korea.
Matthew J. Szucs Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
Lei J. Ding Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.,4Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Andrew Allen Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
Xiaobai Ren Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Henny Haensgen Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Fan Gao Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Hyewhon Rhim Center for Neuroscience, Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Republic of Korea.,8Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology, Seoul 02792, Republic of Korea
Arturo Andrade Department of Biological Sciences, University of New Hampshire, Durham, NH 03824, USA
Jen Q. Pan Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
Steven A. Carr Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
Rushdy Ahmad Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
Weifeng Xu Picower Institute for Learning and Memory, Massachusetts Institute of Technology, Cambridge, Massachusetts; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts.

Collapse

Wei Y, Varanasi RS, Schwarz T, Gomell L, Zhao H, Larson DJ, Sun B, Liu G, Chen H, Raabe D, Gault B. Machine-learning-enhanced time-of-flight mass spectrometry analysis. PATTERNS 2021;2:100192. [PMID: 33659909 PMCID: PMC7892357 DOI: 10.1016/j.patter.2020.100192] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 11/13/2020] [Accepted: 12/17/2020] [Indexed: 01/06/2023]

Abstract

Mass spectrometry is a widespread approach used to work out what the constituents of a material are. Atoms and molecules are removed from the material and collected, and subsequently, a critical step is to infer their correct identities based on patterns formed in their mass-to-charge ratios and relative isotopic abundances. However, this identification step still mainly relies on individual users' expertise, making its standardization challenging, and hindering efficient data processing. Here, we introduce an approach that leverages modern machine learning technique to identify peak patterns in time-of-flight mass spectra within microseconds, outperforming human users without loss of accuracy. Our approach is cross-validated on mass spectra generated from different time-of-flight mass spectrometry (ToF-MS) techniques, offering the ToF-MS community an open-source, intelligent mass spectra analysis.

•

A machine-learning method provides reliable atomic/molecular labels for ToF-MS

•

No human labeling or prior information required

•

The training dataset is artificially generated based on isotopic abundances

•

Method validated on a variety of materials and two ToF-MS-based techniques

Time-of-flight mass spectrometry (ToF-MS) is a mainstream analytical technique widely used in biology, chemistry, and materials science. ToF-MS provides quantitative compositional analysis with high sensitivity across a wide dynamic range of mass-to-charge ratios. A critical step in ToF-MS is to infer the identity of the detected ions. Here, we introduce a machine-learning-enhanced algorithm to provide a user-independent approach to performing this identification using patterns from the natural isotopic abundances of individual atomic and molecular ions, without human labeling or prior knowledge of composition. Results from several materials and techniques are compared with those obtained by field experts. Our open-source, easy-to-implement, reliable analytic method accelerates this identification process. A wide range of ToF-MS-based applications can benefit from our approach, e.g., hunting for patterns of biomarkers or for contamination on solid surfaces in high-throughput data.

Collapse

Ye Z, Vakhrushev SY. The Role of Data-Independent Acquisition for Glycoproteomics. Mol Cell Proteomics 2021;20:100042. [PMID: 33372048 PMCID: PMC8724878 DOI: 10.1074/mcp.r120.002204] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 12/26/2020] [Accepted: 12/28/2020] [Indexed: 12/13/2022] Open

Hendrickx JO, van Gastel J, Leysen H, Martin B, Maudsley S. High-dimensionality Data Analysis of Pharmacological Systems Associated with Complex Diseases. Pharmacol Rev 2020;72:191-217. [PMID: 31843941 DOI: 10.1124/pr.119.017921] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

It is widely accepted that molecular reductionist views of highly complex human physiologic activity, e.g., the aging process, as well as therapeutic drug efficacy are largely oversimplifications. Currently some of the most effective appreciation of biologic disease and drug response complexity is achieved using high-dimensionality (H-D) data streams from transcriptomic, proteomic, metabolomics, or epigenomic pipelines. Multiple H-D data sets are now common and freely accessible for complex diseases such as metabolic syndrome, cardiovascular disease, and neurodegenerative conditions such as Alzheimer's disease. Over the last decade our ability to interrogate these high-dimensionality data streams has been profoundly enhanced through the development and implementation of highly effective bioinformatic platforms. Employing these computational approaches to understand the complexity of age-related diseases provides a facile mechanism to then synergize this pathologic appreciation with a similar level of understanding of therapeutic-mediated signaling. For informative pathology and drug-based analytics that are able to generate meaningful therapeutic insight across diverse data streams, novel informatics processes such as latent semantic indexing and topological data analyses will likely be important. Elucidation of H-D molecular disease signatures from diverse data streams will likely generate and refine new therapeutic strategies that will be designed with a cognizance of a realistic appreciation of the complexity of human age-related disease and drug effects. We contend that informatic platforms should be synergistic with more advanced chemical/drug and phenotypic cellular/tissue-based analytical predictive models to assist in either de novo drug prioritization or effective repurposing for the intervention of aging-related diseases. SIGNIFICANCE STATEMENT: All diseases, as well as pharmacological mechanisms, are far more complex than previously thought a decade ago. With the advent of commonplace access to technologies that produce large volumes of high-dimensionality data (e.g., transcriptomics, proteomics, metabolomics), it is now imperative that effective tools to appreciate this highly nuanced data are developed. Being able to appreciate the subtleties of high-dimensionality data will allow molecular pharmacologists to develop the most effective multidimensional therapeutics with effectively engineered efficacy profiles.

Collapse

Xu R, Sheng J, Bai M, Shu K, Zhu Y, Chang C. A Comprehensive Evaluation of MS/MS Spectrum Prediction Tools for Shotgun Proteomics. Proteomics 2020;20:e1900345. [DOI: 10.1002/pmic.201900345] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 04/29/2020] [Indexed: 01/27/2023]

Bouwmeester R, Gabriels R, Van Den Bossche T, Martens L, Degroeve S. The Age of Data-Driven Proteomics: How Machine Learning Enables Novel Workflows. Proteomics 2020;20:e1900351. [PMID: 32267083 DOI: 10.1002/pmic.201900351] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 03/21/2020] [Indexed: 12/30/2022]

R Cerqueira F, Vasconcelos ATR. OCCAM: prediction of small ORFs in bacterial genomes by means of a target-decoy database approach and machine learning techniques. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020;2020:5989499. [PMID: 33206960 PMCID: PMC7673341 DOI: 10.1093/database/baaa067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 07/11/2020] [Accepted: 07/27/2020] [Indexed: 11/14/2022]

Röst HL. Deep learning adds an extra dimension to peptide fragmentation. Nat Methods 2019;16:469-470. [PMID: 31147636 DOI: 10.1038/s41592-019-0428-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat Methods 2019;16:509-518. [DOI: 10.1038/s41592-019-0426-7] [Citation(s) in RCA: 340] [Impact Index Per Article: 68.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 04/18/2019] [Indexed: 11/08/2022]

Kirik U, Refsgaard JC, Jensen LJ. Improving Peptide-Spectrum Matching by Fragmentation Prediction Using Hidden Markov Models. J Proteome Res 2019;18:2385-2396. [PMID: 31074280 DOI: 10.1021/acs.jproteome.8b00499] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Solovyeva EM, Kopysov VN, Pereverzev AY, Lobas AA, Moshkovskii SA, Gorshkov MV, Boyarkin OV. Method for Identification of Threonine Isoforms in Peptides by Ultraviolet Photofragmentation of Cold Ions. Anal Chem 2019;91:6709-6715. [DOI: 10.1021/acs.analchem.9b00770] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Muth T, Renard BY. Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification? Brief Bioinform 2019;19:954-970. [PMID: 28369237 DOI: 10.1093/bib/bbx033] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Indexed: 01/24/2023] Open

Abstract

While peptide identifications in mass spectrometry (MS)-based shotgun proteomics are mostly obtained using database search methods, high-resolution spectrum data from modern MS instruments nowadays offer the prospect of improving the performance of computational de novo peptide sequencing. The major benefit of de novo sequencing is that it does not require a reference database to deduce full-length or partial tag-based peptide sequences directly from experimental tandem mass spectrometry spectra. Although various algorithms have been developed for automated de novo sequencing, the prediction accuracy of proposed solutions has been rarely evaluated in independent benchmarking studies. The main objective of this work is to provide a detailed evaluation on the performance of de novo sequencing algorithms on high-resolution data. For this purpose, we processed four experimental data sets acquired from different instrument types from collision-induced dissociation and higher energy collisional dissociation (HCD) fragmentation mode using the software packages Novor, PEAKS and PepNovo. Moreover, the accuracy of these algorithms is also tested on ground truth data based on simulated spectra generated from peak intensity prediction software. We found that Novor shows the overall best performance compared with PEAKS and PepNovo with respect to the accuracy of correct full peptide, tag-based and single-residue predictions. In addition, the same tool outpaced the commercial competitor PEAKS in terms of running time speedup by factors of around 12-17. Despite around 35% prediction accuracy for complete peptide sequences on HCD data sets, taken as a whole, the evaluated algorithms perform moderately on experimental data but show a significantly better performance on simulated data (up to 84% accuracy). Further, we describe the most frequently occurring de novo sequencing errors and evaluate the influence of missing fragment ion peaks and spectral noise on the accuracy. Finally, we discuss the potential of de novo sequencing for now becoming more widely used in the field.

Collapse

Hutchins PD, Russell JD, Coon JJ. Mapping Lipid Fragmentation for Tailored Mass Spectral Libraries. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2019;30:659-668. [PMID: 30756325 PMCID: PMC6447430 DOI: 10.1007/s13361-018-02125-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 12/17/2018] [Accepted: 12/17/2018] [Indexed: 05/17/2023]

Maudsley S, Devanarayan V, Martin B, Geerts H. Intelligent and effective informatic deconvolution of “Big Data” and its future impact on the quantitative nature of neurodegenerative disease therapy. Alzheimers Dement 2018;14:961-975. [DOI: 10.1016/j.jalz.2018.01.014] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Revised: 10/03/2017] [Accepted: 01/18/2018] [Indexed: 12/31/2022]

Ciach MA, Łącki MK, Miasojedow B, Lermyte F, Valkenborg D, Sobott F, Gambin A. Estimation of Rates of Reactions Triggered by Electron Transfer in Top-Down Mass Spectrometry. J Comput Biol 2017;25:282-301. [PMID: 28945460 DOI: 10.1089/cmb.2017.0156] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Shao W, Lam H. Tandem mass spectral libraries of peptides and their roles in proteomics research. MASS SPECTROMETRY REVIEWS 2017;36:634-648. [PMID: 27403644 DOI: 10.1002/mas.21512] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 05/21/2016] [Indexed: 05/15/2023]

Tschager T, Rösch S, Gillet L, Widmayer P. A better scoring model for de novo peptide sequencing: the symmetric difference between explained and measured masses. Algorithms Mol Biol 2017;12:12. [PMID: 28603547 PMCID: PMC5464308 DOI: 10.1186/s13015-017-0104-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 04/19/2017] [Indexed: 11/10/2022] Open

Cerqueira FR, Ricardo AM, de Paiva Oliveira A, Graber A, Baumgartner C. MUMAL2: Improving sensitivity in shotgun proteomics using cost sensitive artificial neural networks and a threshold selector algorithm. BMC Bioinformatics 2016;17:472. [PMID: 28105913 PMCID: PMC5249030 DOI: 10.1186/s12859-016-1341-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

This work presents a machine learning strategy to increase sensitivity in tandem mass spectrometry (MS/MS) data analysis for peptide/protein identification. MS/MS yields thousands of spectra in a single run which are then interpreted by software. Most of these computer programs use a protein database to match peptide sequences to the observed spectra. The peptide-spectrum matches (PSMs) must also be assessed by computational tools since manual evaluation is not practicable. The target-decoy database strategy is largely used for error estimation in PSM assessment. However, in general, that strategy does not account for sensitivity.

RESULTS

In a previous study, we proposed the method MUMAL that applies an artificial neural network to effectively generate a model to classify PSMs using decoy hits with increased sensitivity. Nevertheless, the present approach shows that the sensitivity can be further improved with the use of a cost matrix associated with the learning algorithm. We also demonstrate that using a threshold selector algorithm for probability adjustment leads to more coherent probability values assigned to the PSMs. Our new approach, termed MUMAL2, provides a two-fold contribution to shotgun proteomics. First, the increase in the number of correctly interpreted spectra in the peptide level augments the chance of identifying more proteins. Second, the more appropriate PSM probability values that are produced by the threshold selector algorithm impact the protein inference stage performed by programs that take probabilities into account, such as ProteinProphet. Our experiments demonstrate that MUMAL2 reached around 15% of improvement in sensitivity compared to the best current method. Furthermore, the area under the ROC curve obtained was 0.93, demonstrating that the probabilities generated by our model are in fact appropriate. Finally, Venn diagrams comparing MUMAL2 with the best current method show that the number of exclusive peptides found by our method was nearly 4-fold higher, which directly impacts the proteome coverage.

CONCLUSIONS

The inclusion of a cost matrix and a probability threshold selector algorithm to the learning task further improves the target-decoy database analysis for identifying peptides, which optimally contributes to the challenging task of protein level identification, resulting in a powerful computational tool for shotgun proteomics.

Collapse

Williams EG, Wu Y, Jha P, Dubuis S, Blattmann P, Argmann CA, Houten SM, Amariuta T, Wolski W, Zamboni N, Aebersold R, Auwerx J. Systems proteomics of liver mitochondria function. Science 2016;352:aad0189. [PMID: 27284200 PMCID: PMC10859670 DOI: 10.1126/science.aad0189] [Citation(s) in RCA: 208] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2015] [Accepted: 04/15/2016] [Indexed: 12/14/2022]

Affiliation(s)

Evan G Williams Laboratory of Integrative and Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015, Switzerland. These authors contributed equally to this work
Yibo Wu Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Switzerland. These authors contributed equally to this work
Pooja Jha Laboratory of Integrative and Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015, Switzerland
Sébastien Dubuis Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Switzerland
Peter Blattmann Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Switzerland
Carmen A Argmann Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, Box 1498, New York, NY 10029, USA
Sander M Houten Department of Genetics and Genomic Sciences and Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, Box 1498, New York, NY 10029, USA
Tiffany Amariuta Laboratory of Integrative and Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015, Switzerland
Witold Wolski Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Switzerland
Nicola Zamboni Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Switzerland
Ruedi Aebersold Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, CH-8093, Switzerland. Faculty of Science, University of Zurich, CH-8057, Switzerland.
Johan Auwerx Laboratory of Integrative and Systems Physiology, Interfaculty Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, CH-1015, Switzerland.

Collapse

Du YM, Hu Y, Xia Y, Ouyang Z. Power Normalization for Mass Spectrometry Data Analysis and Analytical Method Assessment. Anal Chem 2016;88:3156-63. [PMID: 26882462 PMCID: PMC8135100 DOI: 10.1021/acs.analchem.5b04418] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Bischoff R, Permentier H, Guryev V, Horvatovich P. Genomic variability and protein species — Improving sequence coverage for proteogenomics. J Proteomics 2016;134:25-36. [DOI: 10.1016/j.jprot.2015.09.021] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2015] [Revised: 09/06/2015] [Accepted: 09/14/2015] [Indexed: 12/30/2022]

Lund RR, Leth-Larsen R, Caterino TD, Terp MG, Nissen J, Lænkholm AV, Jensen ON, Ditzel HJ. NADH-Cytochrome b5 Reductase 3 Promotes Colonization and Metastasis Formation and Is a Prognostic Marker of Disease-Free and Overall Survival in Estrogen Receptor-Negative Breast Cancer. Mol Cell Proteomics 2015;14:2988-99. [PMID: 26351264 DOI: 10.1074/mcp.m115.050385] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Indexed: 01/11/2023] Open

Abstract

Metastasis is the main cause of cancer-related deaths and remains the most significant challenge to management of the disease. Metastases are established through a complex multistep process involving intracellular signaling pathways. To gain insight to proteins central to specific steps in metastasis formation, we used a metastasis cell line model that allows investigation of extravasation and colonization of circulating cancer cells to lungs in mice. Using stable isotopic labeling by amino acids in cell culture and subcellular fractionation, the nuclear, cytosol, and mitochondria proteomes were analyzed by LC-MS/MS, identifying a number of proteins that exhibited altered expression in isogenic metastatic versus nonmetastatic cancer cell lines, including NADH-cytochrome b5 reductase 3 (CYB5R3), l-lactate dehydrogenase A (LDHA), Niemann-pick c1 protein (NPC1), and nucleolar RNA helicase 2 (NRH2). The altered expression levels were validated at the protein and transcriptional levels, and analysis of breast cancer biopsies from two cohorts of patients demonstrated a significant correlation between high CYB5R3 expression and poor disease-free and overall survival in patients with estrogen receptor-negative tumors (DFS: p = .02, OS: p = .04). CYB5R3 gene knock-down using siRNA in metastasizing cells led to significantly decreased tumor burden in lungs when injected intravenously in immunodeficient mice. The cellular effects of CYB5R3 knock-down showed signaling alterations associated with extravasation, TGFβ and HIFα pathways, and apoptosis. The decreased apoptosis of CYB5R3 knock-down metastatic cancer cell lines was confirmed in functional assays. Our study reveals a central role of CYB5R3 in extravasation/colonization of cancer cells and demonstrates the ability of our quantitative, comparative proteomic approach to identify key proteins of specific important biological processes that may also prove useful as potential biomarkers of clinical relevance. MS data are available via ProteomeXchange with identifier PXD001391.

Collapse

Goto R, Nakamura Y, Takami T, Sanke T, Tozuka Z. Quantitative LC-MS/MS Analysis of Proteins Involved in Metastasis of Breast Cancer. PLoS One 2015;10:e0130760. [PMID: 26176947 PMCID: PMC4503764 DOI: 10.1371/journal.pone.0130760] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 05/22/2015] [Indexed: 12/29/2022] Open

Raulfs MDM, Breci L, Bernier M, Hamdy OM, Janiga A, Wysocki V, Poutsma JC. Investigations of the mechanism of the "proline effect" in tandem mass spectrometry experiments: the "pipecolic acid effect". JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2014;25:1705-1715. [PMID: 25078156 DOI: 10.1007/s13361-014-0953-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Revised: 06/12/2014] [Accepted: 06/14/2014] [Indexed: 06/03/2023]

Dong NP, Liang YZ, Xu QS, Mok DKW, Yi LZ, Lu HM, He M, Fan W. Prediction of Peptide Fragment Ion Mass Spectra by Data Mining Techniques. Anal Chem 2014;86:7446-54. [DOI: 10.1021/ac501094m] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Pathway and network analysis in proteomics. J Theor Biol 2014;362:44-52. [PMID: 24911777 DOI: 10.1016/j.jtbi.2014.05.031] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2014] [Revised: 05/15/2014] [Accepted: 05/21/2014] [Indexed: 12/14/2022]

Smith R, Mathis AD, Ventura D, Prince JT. Proteomics, lipidomics, metabolomics: a mass spectrometry tutorial from a computer scientist's point of view. BMC Bioinformatics 2014;15 Suppl 7:S9. [PMID: 25078324 PMCID: PMC4110734 DOI: 10.1186/1471-2105-15-s7-s9] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Liang SY, Wu SW, Pu TH, Chang FY, Khoo KH. An adaptive workflow coupled with Random Forest algorithm to identify intact N-glycopeptides detected from mass spectrometry. Bioinformatics 2014;30:1908-16. [DOI: 10.1093/bioinformatics/btu139] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Kelchtermans P, Bittremieux W, De Grave K, Degroeve S, Ramon J, Laukens K, Valkenborg D, Barsnes H, Martens L. Machine learning applications in proteomics research: how the past can boost the future. Proteomics 2014;14:353-66. [PMID: 24323524 DOI: 10.1002/pmic.201300289] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Revised: 09/24/2013] [Accepted: 10/14/2013] [Indexed: 01/22/2023]

Meyer JG, Kim S, Maltby DA, Ghassemian M, Bandeira N, Komives EA. Expanding proteome coverage with orthogonal-specificity α-lytic proteases. Mol Cell Proteomics 2014;13:823-35. [PMID: 24425750 DOI: 10.1074/mcp.m113.034710] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Mass Spectrometry-Based Protein Sequencing Platforms. TRANSLATIONAL BIOINFORMATICS 2014. [DOI: 10.1007/978-94-017-9202-8_5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]