1
|
Battellino T, Yeung D, Neustaeter H, Spicer V, Ogata K, Ishihama Y, Krokhin OV. Retention time prediction for post-translationally modified peptides: Ser, Thr, Tyr-phosphorylation. J Chromatogr A 2024; 1718:464714. [PMID: 38359688 DOI: 10.1016/j.chroma.2024.464714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 01/22/2024] [Accepted: 02/01/2024] [Indexed: 02/17/2024]
Abstract
The development of a peptide retention prediction model for reversed-phase chromatography applications in proteomics is reported for peptides carrying phosphorylated Ser, Thr and Tyr-residues. The major retention features have been assessed using a collection of over 10,000 phosphorylated/non-phosphorylated peptide pairs identified in a series 1D and 2D LC-MS/MS acquisitions using formic acid as ion pairing modifier. Single modification event on average results in increased peptide retention for phosphorylation of Ser (+ 1.46), Thr (+1.33), Tyr (+0.93% acetonitrile, ACN) on gradient elution scale for Luna C18(2) stationary phase. We established several composition and sequence specific features, which drive deviations from these average values. Thus, single phosphorylation of serine results in retention shifts ranging from -2.4 to 5.5% ACN depending on position of the residue, nature of nearest neighbour residues, peptide length, hydrophobicity and pI value, and its propensity to form amphipathic helical structures. We established that the altered ion-pairing environment upon phosphorylation is detrimental for this variability. Hydrophobicity of ion-pairing modifier directly informs the magnitude of expected shifts: (most hydrophilic) 0.5 % acetic acid (larger positive shift upon phosphorylation) > 0.1 % formic acid (positive) > 0.1 % trifluoroacetic (negative) > 0.1 % heptafluorobutyric acid (larger negative shift). The effect of phosphorylation has been also evaluated for several separation conditions used in the first dimension of 2D LC applications: high pH reversed-phase (RP), hydrophilic interaction liquid chromatography (HILIC), strong cation- and strong anion exchange separations.
Collapse
Affiliation(s)
- Taylor Battellino
- Department of Chemistry, University of Manitoba, 360 Parker Building, 144 Dysart Road, Winnipeg, R3T 2N2, Canada
| | - Darien Yeung
- Department of Biochemistry and Medical Genetics, University of Manitoba, 336 BMSB, 745 Bannatyne Avenue, Winnipeg, R3E 0J9, Canada
| | - Haley Neustaeter
- Department of Chemistry, University of Manitoba, 360 Parker Building, 144 Dysart Road, Winnipeg, R3T 2N2, Canada
| | - Vic Spicer
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada
| | - Kosuke Ogata
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
| | - Yasushi Ishihama
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
| | - Oleg V Krokhin
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada; Department of Internal Medicine, University of Manitoba, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada.
| |
Collapse
|
2
|
Hollebrands B, Hageman JA, van de Sande JW, Albada B, Janssen HG. Improved LC-MS identification of short homologous peptides using sequence-specific retention time predictors. Anal Bioanal Chem 2023; 415:2715-2726. [PMID: 37000211 PMCID: PMC10185643 DOI: 10.1007/s00216-023-04670-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 03/17/2023] [Accepted: 03/21/2023] [Indexed: 04/01/2023]
Abstract
Peptides are an important group of compounds contributing to the desired, as well as the undesired taste of a food product. Their taste impressions can include aspects of sweetness, bitterness, savoury, umami and many other impressions depending on the amino acids present as well as their sequence. Identification of short peptides in foods is challenging. We developed a method to assign identities to short peptides including homologous structures, i.e. peptides containing the same amino acids with a different sequence order, by accurate prediction of the retention times during reversed phase separation. To train the method, a large set of well-defined short peptides with systematic variations in the amino acid sequence was prepared by a novel synthesis strategy called 'swapped-sequence synthesis'. Additionally, several proteins were enzymatically digested to yield short peptides. Experimental retention times were determined after reversed phase separation and peptide MS2 data was acquired using a high-resolution mass spectrometer operated in data-dependent acquisition mode (DDA). A support vector regression model was trained using a combination of existing sequence-independent peptide descriptors and a newly derived set of selected amino acid index derived sequence-specific peptide (ASP) descriptors. The model was trained and validated using the experimental retention times of the 713 small food-relevant peptides prepared. Whilst selecting the most useful ASP descriptors for our model, special attention was given to predict the retention time differences between homologous peptide structures. Inclusion of ASP descriptors greatly improved the ability to accurately predict retention times, including retention time differences between 157 homologous peptide pairs. The final prediction model had a goodness-of-fit (Q2) of 0.94; moreover for 93% of the short peptides, the elution order was correctly predicted.
Collapse
Affiliation(s)
- Boudewijn Hollebrands
- Unilever Foods Innovation Centre - Hive, Bronland 14, 6708 WH, Wageningen, the Netherlands.
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands.
| | - Jos A Hageman
- Wageningen University & Research, Biometris, P.O. Box 16, 6700 AA, Wageningen, the Netherlands
| | - Jasper W van de Sande
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands
| | - Bauke Albada
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands
| | - Hans-Gerd Janssen
- Unilever Foods Innovation Centre - Hive, Bronland 14, 6708 WH, Wageningen, the Netherlands
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands
| |
Collapse
|
3
|
Yu H, Tai Q, Yang C, Gao M, Zhang X. Technological development of multidimensional liquid chromatography-mass spectrometry in proteome research. J Chromatogr A 2023; 1700:464048. [PMID: 37167805 DOI: 10.1016/j.chroma.2023.464048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 04/27/2023] [Accepted: 05/03/2023] [Indexed: 05/13/2023]
Abstract
Liquid chromatography-mass spectrometry (LC-MS) is the method of choice for high-throughput proteomic research. Limited by the peak capacity, the separation performance of conventional single-dimensional LC hampers the development of proteomics. Combining different separation modes orthogonally, multidimensional liquid chromatography (MDLC) with high peak capacity was developed to address this challenge. MDLC has evolved rapidly since its establishment, and the progress of proteomics has been greatly facilitated by the advent of novel MDLC-MS-based methods. In this paper, we will review the advances of MDLC-MS-based methodologies and technologies in proteomics studies, from different perspectives including novel application scenarios and proteomic targets, automation, miniaturization, and the improvement of the classic methods in recent years. In addition, attempts regarding new MDLC-MS models are also mentioned together with the outlook of MDLC-MS-based proteomics methods.
Collapse
Affiliation(s)
- Hailong Yu
- Department of Chemistry, Fudan University, 200438, China
| | - Qunfei Tai
- Department of Chemistry, Fudan University, 200438, China
| | - Chenjie Yang
- Department of Chemistry, Fudan University, 200438, China
| | - Mingxia Gao
- Department of Chemistry, Fudan University, 200438, China
| | - Xiangmin Zhang
- Department of Chemistry, Fudan University, 200438, China.
| |
Collapse
|
4
|
Chen M, Zhu P, Wan Q, Ruan X, Wu P, Hao Y, Zhang Z, Sun J, Nie W, Chen S. High-Coverage Four-Dimensional Data-Independent Acquisition Proteomics and Phosphoproteomics Enabled by Deep Learning-Driven Multidimensional Predictions. Anal Chem 2023; 95:7495-7502. [PMID: 37126374 DOI: 10.1021/acs.analchem.2c05414] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Four-dimensional (4D) data-independent acquisition (DIA)-based proteomics is a promising technology. However, its full performance is restricted by the time-consuming building and limited coverage of a project-specific experimental library. Herein, we developed a versatile multifunctional deep learning model Deep4D based on self-attention that could predict the collisional cross section, retention time, fragment ion intensity, and charge state with high accuracies for both the unmodified and phosphorylated peptides and thus established the complete workflows for high-coverage 4D DIA proteomics and phosphoproteomics based on multidimensional predictions. A 4D predicted library containing ∼2 million peptides was established that could realize experimental library-free DIA analysis, and 33% more proteins were identified than using an experimental library of single-shot measurement in the example of HeLa cells. These results show the great values of the convenient high-coverage 4D DIA proteomics methods.
Collapse
Affiliation(s)
- Moran Chen
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Pujia Zhu
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Qiongqiong Wan
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Xianqin Ruan
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Pengfei Wu
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Yanhong Hao
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Zhourui Zhang
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Jian Sun
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Wenjing Nie
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| | - Suming Chen
- The Institute for Advanced Studies, Wuhan University, Wuhan, Hubei 430072, China
| |
Collapse
|
5
|
Mizero B, Yeung D, Spicer V, Krokhin OV. Peptide retention time prediction for peptides with post-translational modifications: N-terminal (α-amine) and lysine (ε-amine) acetylation. J Chromatogr A 2021; 1657:462584. [PMID: 34619563 DOI: 10.1016/j.chroma.2021.462584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 09/23/2021] [Accepted: 09/24/2021] [Indexed: 10/20/2022]
Abstract
Development of a peptide retention prediction model in reversed-phase chromatography is reported for acetylated peptides - both N-terminal (α-) and side chain of Lys (ε-amine) residues. Large-scale proteomic 2D LC-MS analyses of acetylated/non-acetylated tryptic digest of whole human cell lysate have been used to assemble representative retention data sets of 25,000+ modified/non-modified pairs. This allowed elucidating chromatographic behaviour of modified peptides in three different separation modes: high pH reversed-phase, HILIC separation on amide phase (first dimension of 2D) and reversed-phase separation with formic acid as ion-pairing modifier in the second dimension. On average, N-terminal acetylation increases peptide RP retention at acidic pH by 5 Hydrophobicity Index units (% acetonitrile). Acetylation of first lysine adds another 4.1%. The magnitude of the retention shift varies greatly depending on the number of modified amines, peptide length, and N-terminal peptide sequence. Large retention shifts have been observed for peptides with hydrophobic N-termini and specifically peptides carrying sequences characteristic for amphipathic helical structures - all in complete agreement with major sequence-specific features of RP retention mechanism. The utility of the modified Sequence Specific Retention Calculator model has been verified for the in-vivo N-terminally acetylated peptides detected by 2D LC-MS/MS analysis of a yeast tryptic digest. The effect of N-terminal acetylation was also evaluated for six different HILIC columns, strong cation- and strong anion exchange separations using previously acquired 2D LC-MS/MS data.
Collapse
Affiliation(s)
- Benilde Mizero
- Department of Chemistry, University of Manitoba, 360 Parker Building, 144 Dysart Road, Winnipeg, R3T 2N2, Canada
| | - Darien Yeung
- Department of Biochemistry and Medical Genetics, University of Manitoba, 336 BMSB, 745 Bannatyne Avenue, Winnipeg, R3E 0J9, Canada
| | - Vic Spicer
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada
| | - Oleg V Krokhin
- Department of Chemistry, University of Manitoba, 360 Parker Building, 144 Dysart Road, Winnipeg, R3T 2N2, Canada; Department of Biochemistry and Medical Genetics, University of Manitoba, 336 BMSB, 745 Bannatyne Avenue, Winnipeg, R3E 0J9, Canada; Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada; Department of Internal Medicine, University of Manitoba, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada.
| |
Collapse
|
6
|
Bouwmeester R, Gabriels R, Hulstaert N, Martens L, Degroeve S. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat Methods 2021; 18:1363-1369. [PMID: 34711972 DOI: 10.1038/s41592-021-01301-5] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 09/13/2021] [Indexed: 11/09/2022]
Abstract
The inclusion of peptide retention time prediction promises to remove peptide identification ambiguity in complex liquid chromatography-mass spectrometry identification workflows. However, due to the way peptides are encoded in current prediction models, accurate retention times cannot be predicted for modified peptides. This is especially problematic for fledgling open searches, which will benefit from accurate retention time prediction for modified peptides to reduce identification ambiguity. We present DeepLC, a deep learning peptide retention time predictor using peptide encoding based on atomic composition that allows the retention time of (previously unseen) modified peptides to be predicted accurately. We show that DeepLC performs similarly to current state-of-the-art approaches for unmodified peptides and, more importantly, accurately predicts retention times for modifications not seen during training. Moreover, we show that DeepLC's ability to predict retention times for any modification enables potentially incorrect identifications to be flagged in an open search of a wide variety of proteome data.
Collapse
Affiliation(s)
- Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Niels Hulstaert
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium. .,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
| | - Sven Degroeve
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| |
Collapse
|
7
|
Chang CH, Yeung D, Spicer V, Ogata K, Krokhin O, Ishihama Y. Sequence-Specific Model for Predicting Peptide Collision Cross Section Values in Proteomic Ion Mobility Spectrometry. J Proteome Res 2021; 20:3600-3610. [PMID: 34133192 DOI: 10.1021/acs.jproteome.1c00185] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The contribution of peptide amino acid sequence to collision cross section values (CCS) has been investigated using a dataset of ∼134 000 peptides of four different charge states (1+ to 4+). The migration data were acquired using a two-dimensional liquid chromatography (LC)/trapped ion mobility spectrometry/quadrupole/time-of-flight mass spectrometry (MS) analysis of HeLa cell digests created using seven different proteases and was converted to CCS values. Following the previously reported modeling approaches using intrinsic size parameters (ISP), we extended this methodology to encode the position of individual residues within a peptide sequence. A generalized prediction model was built by dividing the dataset into eight groups (four charges for both tryptic/nontryptic peptides). Position-dependent ISPs were independently optimized for the eight subsets of peptides, resulting in prediction accuracy of ∼0.981 for the entire population of peptides. We find that ion mobility is strongly affected by the peptide's ability to solvate the positively charged sites. Internal positioning of polar residues and proline leads to decreased CCS values as they improve charge solvation; conversely, this ability decreases with increasing peptide charge due to electrostatic repulsion. Furthermore, higher helical propensity and peptide hydrophobicity result in a preferential formation of extended structures with higher than predicted CCS values. Finally, acidic/basic residues exhibit position-dependent ISP behavior consistent with electrostatic interaction with the peptide macrodipole, which affects the peptide helicity. The MS raw data files have been deposited with the ProteomeXchange Consortium via the jPOST partner repository (http://jpostdb.org) with the dataset identifiers PXD021440/JPST000959, PXD022800/JPST001017, and PXD026087/ JPST001176.
Collapse
Affiliation(s)
- Chih-Hsiang Chang
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
| | - Darien Yeung
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada
- Manitoba Centre for Proteomics and Systems Biology, University of Manitoba, Winnipeg, Manitoba R3E 3P4, Canada
- Department of Internal Medicine, University of Manitoba, Winnipeg, Manitoba R3E 3P4, Canada
| | - Victor Spicer
- Manitoba Centre for Proteomics and Systems Biology, University of Manitoba, Winnipeg, Manitoba R3E 3P4, Canada
| | - Kosuke Ogata
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
| | - Oleg Krokhin
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Manitoba R3E 0J9, Canada
- Manitoba Centre for Proteomics and Systems Biology, University of Manitoba, Winnipeg, Manitoba R3E 3P4, Canada
- Department of Internal Medicine, University of Manitoba, Winnipeg, Manitoba R3E 3P4, Canada
- Department of Chemistry, University of Manitoba, 360 Parker Building, Winnipeg, Manitoba R3T 2N2, Canada
| | - Yasushi Ishihama
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
- Laboratory of Clinical and Analytical Chemistry, National Institute of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka 567-0085, Japan
| |
Collapse
|
8
|
Giese SH, Sinn LR, Wegner F, Rappsilber J. Retention time prediction using neural networks increases identifications in crosslinking mass spectrometry. Nat Commun 2021; 12:3237. [PMID: 34050149 PMCID: PMC8163845 DOI: 10.1038/s41467-021-23441-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 04/26/2021] [Indexed: 12/13/2022] Open
Abstract
Crosslinking mass spectrometry has developed into a robust technique that is increasingly used to investigate the interactomes of organelles and cells. However, the incomplete and noisy information in the mass spectra of crosslinked peptides limits the numbers of protein-protein interactions that can be confidently identified. Here, we leverage chromatographic retention time information to aid the identification of crosslinked peptides from mass spectra. Our Siamese machine learning model xiRT achieves highly accurate retention time predictions of crosslinked peptides in a multi-dimensional separation of crosslinked E. coli lysate. Importantly, supplementing the search engine score with retention time features leads to a substantial increase in protein-protein interactions without affecting confidence. This approach is not limited to cell lysates and multi-dimensional separation but also improves considerably the analysis of crosslinked multiprotein complexes with a single chromatographic dimension. Retention times are a powerful complement to mass spectrometric information to increase the sensitivity of crosslinking mass spectrometry analyses.
Collapse
Affiliation(s)
- Sven H Giese
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany
- Data Analytics and Computational Statistics, Hasso Plattner Institute for Digital Engineering, Potsdam, Germany
- Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
| | - Ludwig R Sinn
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany
| | - Fritz Wegner
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany
| | - Juri Rappsilber
- Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany.
- Wellcome Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
9
|
de Jong L, Roseboom W, Kramer G. Towards low false discovery rate estimation for protein-protein interactions detected by chemical cross-linking. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2021; 1869:140655. [PMID: 33812047 DOI: 10.1016/j.bbapap.2021.140655] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 03/28/2021] [Accepted: 03/29/2021] [Indexed: 01/16/2023]
Abstract
Chemical cross-linking (CX) of proteins in vivo or in cell free extracts followed by mass spectrometric (MS) identification of linked peptide pairs (CXMS) can reveal protein-protein interactions (PPIs) both at a proteome wide scale and the level of cross-linked amino acid residues. However, error estimation at the level of PPI remains challenging in large scale datasets. Here we discuss recent advances in the recognition of spurious inter-protein peptide pairs and in diminishing the FDR for these PPI-signaling cross-links, such as the use of chromatographic retention time prediction, in order to come to a more reliable reporting of PPIs.
Collapse
Affiliation(s)
- Luitzen de Jong
- Swammerdam Institute for Life Sciences, Mass Spectrometry of Biomolecules, University of Amsterdam, Science Park 904, 1098 HX Amsterdam, the Netherlands.
| | - Winfried Roseboom
- Swammerdam Institute for Life Sciences, Mass Spectrometry of Biomolecules, University of Amsterdam, Science Park 904, 1098 HX Amsterdam, the Netherlands
| | - Gertjan Kramer
- Swammerdam Institute for Life Sciences, Mass Spectrometry of Biomolecules, University of Amsterdam, Science Park 904, 1098 HX Amsterdam, the Netherlands
| |
Collapse
|
10
|
Villacrés C, Spicer V, Krokhin OV. Confident Identification of Citrullination and Carbamylation Assisted by Peptide Retention Time Prediction. J Proteome Res 2021; 20:1571-1581. [PMID: 33523662 DOI: 10.1021/acs.jproteome.0c00775] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The chromatographic behavior of peptides carrying citrulline and homocitrulline residues in proteomic two-dimensional (2D) liquid chromatography-mass spectrometry (LC-MS) experiments has been investigated. The primary goal of this study was to determine the chromatographic conditions that allow differentiating between arginine citrullination and deamidation of asparagine based on retention data, improving the confidence of MS-based identifications. Carbamylation was used as a reference point due to a high degree of similarity between modification products and anticipated changes in chromatographic behavior. We applied 2D LC-MS/MS (a high-pH-low-pH reversed phase (RP), hydrophilic interaction liquid chromatography (HILIC)-low-pH RP, and strong cation exchange (SCX)-low-pH RP) to acquire retention data for modified-nonmodified peptide pairs in the four separation modes. Modifications of a standard protein mixture were induced enzymatically (PAD-2) or chemically (urea) for citrullination and carbamylation, respectively. Deamidation occurs spontaneously. Similar retention shifts were observed for all three modifications in a high-pH RP (decrease) and a low-pH RP (increase), thus limiting the applicability of this 2D LC combination. HILIC on bare silica and strong cation exchange separations have been probed to amplify the effect of charge loss upon citrullination, with SCX demonstrating the most differentiating power: the elimination of basic residues upon citrullination/carbamylation results in an ∼58 mM KCl retention decrease, while retention of deamidated products decreases slightly.
Collapse
Affiliation(s)
- Carina Villacrés
- Manitoba Centre for Proteomics and Systems Biology, Department of Internal Medicine, University of Manitoba, Winnipeg, MB R3E 3P4, Canada
| | - Victor Spicer
- Manitoba Centre for Proteomics and Systems Biology, Department of Internal Medicine, University of Manitoba, Winnipeg, MB R3E 3P4, Canada
| | - Oleg V Krokhin
- Manitoba Centre for Proteomics and Systems Biology, Department of Internal Medicine, University of Manitoba, Winnipeg, MB R3E 3P4, Canada.,Internal Medicine, University of Manitoba, Winnipeg, MB R3E 3P4, Canada
| |
Collapse
|
11
|
Chang CH, Chang HY, Rappsilber J, Ishihama Y. Isolation of Acetylated and Unmodified Protein N-Terminal Peptides by Strong Cation Exchange Chromatographic Separation of TrypN-Digested Peptides. Mol Cell Proteomics 2020; 20:100003. [PMID: 33517145 PMCID: PMC7857546 DOI: 10.1074/mcp.tir120.002148] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 10/22/2020] [Accepted: 11/02/2020] [Indexed: 12/25/2022] Open
Abstract
We developed a simple and rapid method to enrich protein N-terminal peptides, in which the protease TrypN is first employed to generate protein N-terminal peptides without Lys or Arg and internal peptides with two positive charges at their N termini, and then, the N-terminal peptides with or without N-acetylation are separated from the internal peptides by strong cation exchange chromatography according to a retention model based on the charge/orientation of peptides. This approach was applied to 20 μg of human HEK293T cell lysate proteins to profile the N-terminal proteome. On average, 1550 acetylated and 200 unmodified protein N-terminal peptides were successfully identified in a single LC/MS/MS run with less than 3% contamination with internal peptides, even when we accepted only canonical protein N termini registered in the Swiss-Prot database. Because this method involves only two steps, protein digestion and chromatographic separation, without the need for tedious chemical reactions, it should be useful for comprehensive profiling of protein N termini, including proteoforms with neo-N termini.
Collapse
Affiliation(s)
- Chih-Hsiang Chang
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
| | - Hsin-Yi Chang
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan; Graduate Institute of Metabolism and Obesity Sciences, Taipei Medical University, Taipei, Taiwan
| | - Juri Rappsilber
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan; Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany; Wellcome Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Yasushi Ishihama
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan; Laboratory of Clinical and Analytical Chemistry, National Institute of Biomedical Innovation, Health and Nutrition, Ibaraki, Osaka, Japan.
| |
Collapse
|
12
|
Wen B, Zeng W, Liao Y, Shi Z, Savage SR, Jiang W, Zhang B. Deep Learning in Proteomics. Proteomics 2020; 20:e1900335. [PMID: 32939979 PMCID: PMC7757195 DOI: 10.1002/pmic.201900335] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 09/14/2020] [Indexed: 12/17/2022]
Abstract
Proteomics, the study of all the proteins in biological systems, is becoming a data-rich science. Protein sequences and structures are comprehensively catalogued in online databases. With recent advancements in tandem mass spectrometry (MS) technology, protein expression and post-translational modifications (PTMs) can be studied in a variety of biological systems at the global scale. Sophisticated computational algorithms are needed to translate the vast amount of data into novel biological insights. Deep learning automatically extracts data representations at high levels of abstraction from data, and it thrives in data-rich scientific research domains. Here, a comprehensive overview of deep learning applications in proteomics, including retention time prediction, MS/MS spectrum prediction, de novo peptide sequencing, PTM prediction, major histocompatibility complex-peptide binding prediction, and protein structure prediction, is provided. Limitations and the future directions of deep learning in proteomics are also discussed. This review will provide readers an overview of deep learning and how it can be used to analyze proteomics data.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Wen‐Feng Zeng
- Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS)Chinese Academy of SciencesInstitute of Computing TechnologyBeijing100190China
| | - Yuxing Liao
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Zhiao Shi
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Sara R. Savage
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Wen Jiang
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| | - Bing Zhang
- Lester and Sue Smith Breast CenterBaylor College of MedicineHoustonTX77030USA
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTX77030USA
| |
Collapse
|
13
|
Proteomic Profiling of Emiliania huxleyi Using a Three-Dimensional Separation Method Combined with Tandem Mass Spectrometry. Molecules 2020; 25:molecules25133028. [PMID: 32630776 PMCID: PMC7411631 DOI: 10.3390/molecules25133028] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 06/30/2020] [Accepted: 06/30/2020] [Indexed: 12/31/2022] Open
Abstract
Emiliania huxleyi is one of the most abundant marine planktons, and it has a crucial feature in the carbon cycle. However, proteomic analyses of Emiliania huxleyi have not been done extensively. In this study, a three-dimensional liquid chromatography (3D-LC) system consisting of strong cation exchange, high- and low-pH reversed-phase liquid chromatography was established for in-depth proteomic profiling of Emiliania huxleyi. From tryptic proteome digest, 70 fractions were generated and analyzed using liquid chromatography-tandem mass spectrometry. In total, more than 84,000 unique peptides and 10,000 proteins groups were identified with a false discovery rate of ≤0.01. The physicochemical properties of the identified peptides were evaluated. Using ClueGO, approximately 700 gene ontology terms and 15 pathways were defined from the identified protein groups with p-value ≤0.05, covering a wide range of biological processes, cellular components, and molecular functions. Many biological processes associated with CO2 fixation, photosynthesis, biosynthesis, and metabolic process were identified. Various molecular functions relating to protein binding and enzyme activities were also found. The 3D-LC strategy is a powerful approach for comparative proteomic studies on Emiliania huxleyi to reveal changes in its protein level and related mechanism.
Collapse
|
14
|
Wen B, Li K, Zhang Y, Zhang B. Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis. Nat Commun 2020; 11:1759. [PMID: 32273506 PMCID: PMC7145864 DOI: 10.1038/s41467-020-15456-w] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 03/10/2020] [Indexed: 01/01/2023] Open
Abstract
Genomics-based neoantigen discovery can be enhanced by proteomic evidence, but there remains a lack of consensus on the performance of different quality control methods for variant peptide identification in proteogenomics. We propose to use the difference between accurately predicted and observed retention times for each peptide as a metric to evaluate different quality control methods. To this end, we develop AutoRT, a deep learning algorithm with high accuracy in retention time prediction. Analysis of three cancer data sets with a total of 287 tumor samples using different quality control strategies results in substantially different numbers of identified variant peptides and putative neoantigens. Our systematic evaluation, using the proposed retention time metric, provides insights and practical guidance on the selection of quality control strategies. We implement the recommended strategy in a computational workflow named NeoFlow to support proteogenomics-based neoantigen prioritization, enabling more sensitive discovery of putative neoantigens. Identifying mutation-derived neoantigens by proteogenomics requires robust strategies for quality control. Here, the authors propose peptide retention time as an evaluation metric for proteogenomics quality control methods, and develop a deep learning algorithm for accurate retention time prediction.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Yun Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA. .,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
15
|
Yeung D, Mizero B, Gussakovsky D, Klaassen N, Lao Y, Spicer V, Krokhin OV. Separation Orthogonality in Liquid Chromatography–Mass Spectrometry for Proteomic Applications: Comparison of 16 Different Two-Dimensional Combinations. Anal Chem 2020; 92:3904-3912. [DOI: 10.1021/acs.analchem.9b05407] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Darien Yeung
- Department of Biochemistry and Medical Genetics, University of Manitoba, 336 Basic Medical Sciences Building, 745 Bannatyne Avenue, Winnipeg, Manitoba R3E 0J9, Canada
| | - Benilde Mizero
- Department of Chemistry, University of Manitoba, 360 Parker Building, Winnipeg, Manitoba R3T 2N2, Canada
| | - Daniel Gussakovsky
- Department of Chemistry, University of Manitoba, 360 Parker Building, Winnipeg, Manitoba R3T 2N2, Canada
| | - Nicole Klaassen
- Department of Chemistry, University of Manitoba, 360 Parker Building, Winnipeg, Manitoba R3T 2N2, Canada
| | | | | | - Oleg V. Krokhin
- Department of Biochemistry and Medical Genetics, University of Manitoba, 336 Basic Medical Sciences Building, 745 Bannatyne Avenue, Winnipeg, Manitoba R3E 0J9, Canada
- Department of Chemistry, University of Manitoba, 360 Parker Building, Winnipeg, Manitoba R3T 2N2, Canada
| |
Collapse
|
16
|
Yeung D, Klaassen N, Mizero B, Spicer V, Krokhin OV. Peptide retention time prediction in hydrophilic interaction liquid chromatography: Zwitter-ionic sulfoalkylbetaine and phosphorylcholine stationary phases. J Chromatogr A 2020; 1619:460909. [PMID: 32007221 DOI: 10.1016/j.chroma.2020.460909] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 01/17/2020] [Accepted: 01/21/2020] [Indexed: 01/01/2023]
Abstract
Peptide retention time prediction models have been developed for zwitter-ionic ZIC-HILIC and ZIC-cHILIC stationary phases (pH 4.5 eluents) using proteomics-derived retention datasets of ~30 thousand tryptic peptides each. Overall, hydrophilicity of these stationary phases was found to be similar to the previously studied Amide HILIC phase, but lower compared to bare silicas. Peptide retention is driven by interactions of all charged (hydrophilic) residues at pH 4.5 (Asp, Glu, Arg, Lys, His), but shows specificity according to orientation of functional groups in zwitter-ionic pair. Thus, ZIC-cHILIC exhibits an increased contribution of negatively charged Asp and Glu due to the distal positioning of positively charged quaternary amines on the stationary phase. These findings confirm that HILIC interactions are driven by both peptide distribution between water layer adsorbed on the stationary phase and by interactions specific to functional groups of the packing material. Sequence-Specific Retention Calculator HILIC models were optimized for these columns showing 0.967-0.976 R2-values between experimental and predicted retention values. ZIC-HILIC separations represent a good choice as a first dimension in 2D LC-MS of peptide mixtures with correlations between retention values of ZIC-HILIC against RPLC found at 0.197 (ZIC-HILIC) and 0.137 (ZIC-cHILIC) R2-values, confirming a good orthogonality.
Collapse
Affiliation(s)
- Darien Yeung
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada; Department of Biochemistry and Medical Genetics, University of Manitoba, 336 BMSB, 745 Bannatyne Avenue, Winnipeg, R3E 0J9, Canada
| | - Nicole Klaassen
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada; Department of Chemistry, University of Manitoba, 360 Parker Building, 144 Dysart Road, Winnipeg, R3T 2N2, Canada
| | - Benilde Mizero
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada; Department of Chemistry, University of Manitoba, 360 Parker Building, 144 Dysart Road, Winnipeg, R3T 2N2, Canada
| | - Victor Spicer
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada
| | - Oleg V Krokhin
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada; Department of Biochemistry and Medical Genetics, University of Manitoba, 336 BMSB, 745 Bannatyne Avenue, Winnipeg, R3E 0J9, Canada; Department of Chemistry, University of Manitoba, 360 Parker Building, 144 Dysart Road, Winnipeg, R3T 2N2, Canada; Department of Internal Medicine, University of Manitoba, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada.
| |
Collapse
|
17
|
Abstract
In bottom-up proteomics, proteins are typically identified by enzymatic digestion into peptides, tandem mass spectrometry and comparison of the tandem mass spectra with those predicted from a sequence database for peptides within measurement uncertainty from the experimentally obtained mass. Although now decreasingly common, isolated proteins or simple protein mixtures can also be identified by measuring only the masses of the peptides resulting from the enzymatic digest, without any further fragmentation. Separation methods such as liquid chromatography and electrophoresis are often used to fractionate complex protein or peptide mixtures prior to analysis by mass spectrometry. Although the primary reason for this is to avoid ion suppression and improve data quality, these separations are based on physical and chemical properties of the peptides or proteins and therefore also provide information about them. Depending on the separation method, this could be protein molecular weight (SDS-PAGE), isoelectric point (IEF), charge at a known pH (ion exchange chromatography), or hydrophobicity (reversed phase chromatography). These separations produce approximate measurements on properties that to some extent can be predicted from amino acid sequences. In the case of molecular weight of proteins without posttranslational modifications this is straightforward: simply add the molecular weights of the amino acid residues in the protein. For IEF, charge and hydrophobicity, the order of the amino acids, and folding state of the peptide or protein also matter, but it is nevertheless possible to predict the behavior of peptides and proteins in these separation methods to a degree which renders such predictions useful. This chapter reviews the topic of using data from separation methods for identification and validation in proteomics, with special emphasis on predicting retention times of tryptic peptides in reversed-phase chromatography under acidic conditions, as this is one of the most commonly used separation methods in bottom-up proteomics.
Collapse
|
18
|
Xiao W, Zhang J, Wang Y, Liu Z, Wang F, Sun J, Chang L, Xia Z, Li Y, Xu P. Ac-LysargiNase Complements Trypsin for the Identification of Ubiquitinated Sites. Anal Chem 2019; 91:15890-15898. [DOI: 10.1021/acs.analchem.9b04340] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Weidi Xiao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
| | - Junling Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
| | - Yihao Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
| | - Zijuan Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
- School of Basic Medical Science, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, Wuhan University, Wuhan 430071, P. R. China
| | - Fuqiang Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
| | - Jinshuai Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
| | - Zongping Xia
- Translational Medicine Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan 450018, P. R. China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
| | - Ping Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug, Institute of Lifeomics, Beijing 102206, P. R. China
- School of Basic Medical Science, Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, Wuhan University, Wuhan 430071, P. R. China
- Guizhou University School of Medicine, Guiyang 550025, P.R. China
- Second Clinical Medicine Collage, Guangzhou University Chinese Medicine, Guangzhou 510006, P. R. China
| |
Collapse
|
19
|
Ang E, Neustaeter H, Spicer V, Perreault H, Krokhin O. Retention Time Prediction for Glycopeptides in Reversed-Phase Chromatography for Glycoproteomic Applications. Anal Chem 2019; 91:13360-13366. [PMID: 31566965 DOI: 10.1021/acs.analchem.9b02584] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The sequence-specific retention calculator algorithm (SSRCalc) [ Krokhin , O. V. Anal. Chem. 2006 , 78 , 7785 ] was adapted for the prediction of retention times of N-glycopeptides separated by reversed-phase high performance liquid chromatography (RPLC). The retention time shifts (dHI = HIglyco - HIdeglyco, where HI is the hydrophobicity index, measured in percent acetonitrile units) used for modeling were measured for 602 glycopeptides versus 123 of their deglycosylated analogues. Our method used a tryptic digest of 12 purified glycoproteins, glycopeptide enrichment, deglycosylation with PNGaseF, and RPLC-MS/MS analysis of combined (deglycosylated and intact) peptide mixtures. On average, glycosylation yields a 0.79% acetonitrile unit decrease in retention, compared with the hydrophobicity indices of their deglycosylated analogues. These values, however, are drastically different for asialo (-1.37% acetonitrile units), monosialylated (-0.47% acetonitrile units), disialylated (+0.61% acetonitrile units), and trisialylated (+1.94% acetonitrile units) glycans. Peptide retention time shifts upon glycosylation (dHI) vary depending on the number of monosaccharide units, the presence or absence of sialic acid, peptide hydrophobicity, and the number of position-dependent features. These features are mostly driven by competing effects of acidic residues (aspartic acid and sialic acid) on ion-pair formation and by nearest-neighbor effects of hydrophilic glycans. The accuracy of the modified prediction model for glycopeptides approaches that of the prediction for nonmodified species (R2 = 0.97 vs 0.98). However, retention time prediction based on the experimental retention values of deglycosylated analogues (HIglyco = HIdeglyco + dHI, R2 = 0.995) is much more accurate, thus providing a solid support for glycopeptide identification in complex samples based on mass and retention time.
Collapse
Affiliation(s)
- Evelyn Ang
- Chemistry Department , University of Manitoba , Winnipeg , Manitoba R3T 2N2 , Canada.,Department of Internal Medicine , University of Manitoba , Winnipeg , Manitoba R3A 1R9 , Canada
| | - Haley Neustaeter
- Department of Internal Medicine , University of Manitoba , Winnipeg , Manitoba R3A 1R9 , Canada
| | - Vic Spicer
- Department of Internal Medicine , University of Manitoba , Winnipeg , Manitoba R3A 1R9 , Canada
| | - Hélène Perreault
- Chemistry Department , University of Manitoba , Winnipeg , Manitoba R3T 2N2 , Canada
| | - Oleg Krokhin
- Chemistry Department , University of Manitoba , Winnipeg , Manitoba R3T 2N2 , Canada.,Department of Internal Medicine , University of Manitoba , Winnipeg , Manitoba R3A 1R9 , Canada.,Manitoba Centre for Proteomics and Systems Biology , University of Manitoba , Winnipeg , Manitoba R3E 3P4 , Canada
| |
Collapse
|
20
|
Klaassen N, Spicer V, Krokhin OV. Universal retention standard for peptide separations using various modes of high-performance liquid chromatography. J Chromatogr A 2018; 1588:163-168. [PMID: 30626502 DOI: 10.1016/j.chroma.2018.12.057] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 12/20/2018] [Accepted: 12/24/2018] [Indexed: 10/27/2022]
Abstract
Peptide retention standards are widely used by chromatography specialists. They can be used for quality control of peptide separations (separation efficiency, selectivity, retention values) and for accurate concatenation of retention data from multiple acquisitions in proteomics. So far the repertoire of available retention standards is mostly limited to reversed-phase separations. We introduce a synthetic peptide mixture which can be used in conjunction with the most popular peptide separation techniques: reversed-phase (RPLC), strong-cation exchange (SCX), (strong-anion exchange) SAX and hydrophilic interaction liquid chromatography (HILIC). Target sequences were first designed in-silico using Sequence-Specific Retention Calculator models covering all major peptide separation mechanisms. Peptides were also designed while keeping in mind the simplicity of retention time assignment using MS detection: they all have nearly identical masses and identical intense y3 fragment ions. This contribution demonstrates the application of this mixture for characterization of eight HILIC as well as SAX, SCX and C18 columns.
Collapse
Affiliation(s)
- Nicole Klaassen
- Department of Chemistry, University of Manitoba, 360 Parker Building, Winnipeg, Manitoba R3T 2N2, Canada
| | - Victor Spicer
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada
| | - Oleg V Krokhin
- Manitoba Centre for Proteomics and Systems Biology, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada; Department of Internal Medicine, University of Manitoba, 799 JBRC, 715 McDermot Avenue, Winnipeg, R3E 3P4, Canada.
| |
Collapse
|
21
|
Yuan H, Jiang B, Zhao B, Zhang L, Zhang Y. Recent Advances in Multidimensional Separation for Proteome Analysis. Anal Chem 2018; 91:264-276. [DOI: 10.1021/acs.analchem.8b04894] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Huiming Yuan
- Key Laboratory of Separation Science for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian, Liaoning 116023, China
| | - Bo Jiang
- Key Laboratory of Separation Science for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian, Liaoning 116023, China
| | - Baofeng Zhao
- Key Laboratory of Separation Science for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian, Liaoning 116023, China
| | - Lihua Zhang
- Key Laboratory of Separation Science for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian, Liaoning 116023, China
| | - Yukui Zhang
- Key Laboratory of Separation Science for Analytical Chemistry, National Chromatographic Research and Analysis Center, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian, Liaoning 116023, China
| |
Collapse
|
22
|
Pei D, Xi XJ, Huang XY, Quan KJ, Wei JT, Wang NL, Di DL. Isolation of high-purity peptide Val-Val-Tyr-Pro from Globin Peptide using MCI gel column combined with high-speed counter-current chromatography. J Sep Sci 2018; 41:4559-4566. [PMID: 30358082 DOI: 10.1002/jssc.201800972] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 10/12/2018] [Accepted: 10/18/2018] [Indexed: 12/21/2022]
Abstract
Peptides have gained increased interest over the past several decades because of their therapeutics. In this research, a strategy combining MCI gel column chromatography and high-speed countercurrent chromatography was developed for the separation of high-purity peptide Val-Val-Tyr-Pro from Globin Peptide. First, the fraction of Val-Val-Tyr-Pro mixtures with a purity of 15.8% was obtained by using MCI gel column with a mixture of ethanol/water (20:80, v/v/v). Then, the high-purity Val-Val-Tyr-Pro was separated by high-speed countercurrent chromatography with a aqueous two phase systems of ethanol/acetonitrile/iso-propyl alcohol/(NH4 )2 SO4 Saturated solution /H2 O (0.5:0.5:0.25:1.5:0.7,v/v). The ammonium sulfate from high-speed countercurrent chromatography fractions was removed from target compound by MCI gel column chromatography using ethanol/water in stepwise elution mode. A 78 mg of Val-Val-Tyr-Pro was successfully purified with the purities of 98.80% from 30 g crude Globin Peptide. The amino acid sequence of the Val-Val-Tyr-Pro was determined by electrospray ionization high resolution tandem mass spectrometry. The method presents a practical strategy for the large-scale separation of pure peptide Val-Val-Tyr-Pro from Globin Peptide, and provides a reference method for obtaining high-purity peptide from other polypeptide mixtures.
Collapse
Affiliation(s)
- Dong Pei
- CAS Key Laboratory of Chemistry of Northwestern Plant Resources and Key Laboratory for Natural Medicine of Gansu Province, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou, Gansu, P. R. China.,Center of Resource Chemical & New Material, Qingdao, Shandong, P. R. China
| | - Xing-Jun Xi
- Food and Agriculture Standardization Institute, China National Institute of Standardization, Beijing, P. R. China
| | - Xin-Yi Huang
- CAS Key Laboratory of Chemistry of Northwestern Plant Resources and Key Laboratory for Natural Medicine of Gansu Province, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou, Gansu, P. R. China
| | - Kai-Jun Quan
- CAS Key Laboratory of Chemistry of Northwestern Plant Resources and Key Laboratory for Natural Medicine of Gansu Province, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou, Gansu, P. R. China.,University of Chinese Academy of Sciences, Beijing, P. R. China
| | - Jan-Teng Wei
- CAS Key Laboratory of Chemistry of Northwestern Plant Resources and Key Laboratory for Natural Medicine of Gansu Province, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou, Gansu, P. R. China.,Center of Resource Chemical & New Material, Qingdao, Shandong, P. R. China
| | - Ning-Li Wang
- CAS Key Laboratory of Chemistry of Northwestern Plant Resources and Key Laboratory for Natural Medicine of Gansu Province, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou, Gansu, P. R. China.,Center of Resource Chemical & New Material, Qingdao, Shandong, P. R. China
| | - Duo-Long Di
- CAS Key Laboratory of Chemistry of Northwestern Plant Resources and Key Laboratory for Natural Medicine of Gansu Province, Lanzhou Institute of Chemical Physics, Chinese Academy of Sciences, Lanzhou, Gansu, P. R. China
| |
Collapse
|
23
|
Huang T, Armbruster MR, Coulton JB, Edwards JL. Chemical Tagging in Mass Spectrometry for Systems Biology. Anal Chem 2018; 91:109-125. [DOI: 10.1021/acs.analchem.8b04951] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Tianjiao Huang
- Department of Chemistry, Saint Louis University, 3501 Laclede Avenue, St. Louis, Missouri 63103, United States
| | - Michael R. Armbruster
- Department of Chemistry, Saint Louis University, 3501 Laclede Avenue, St. Louis, Missouri 63103, United States
| | - John B. Coulton
- Department of Chemistry, Saint Louis University, 3501 Laclede Avenue, St. Louis, Missouri 63103, United States
| | - James L. Edwards
- Department of Chemistry, Saint Louis University, 3501 Laclede Avenue, St. Louis, Missouri 63103, United States
| |
Collapse
|
24
|
Ma C, Ren Y, Yang J, Ren Z, Yang H, Liu S. Improved Peptide Retention Time Prediction in Liquid Chromatography through Deep Learning. Anal Chem 2018; 90:10881-10888. [PMID: 30114359 DOI: 10.1021/acs.analchem.8b02386] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The accuracy of peptide retention time (RT) prediction model in liquid chromatography (LC) is still not sufficient for wider implementation in proteomics practice. Herein, we propose deep learning as an ideal tool to considerably improve this prediction. A new peptide RT prediction tool, DeepRT, was designed using a capsule network model, and the public data sets containing peptides separated by reverse-phase liquid chromatography were used to evaluate the DeepRT performance. Compared with other prevailing RT predictors, DeepRT attained overall improvement in the prediction of peptide RTs with an R2 of ∼0.994. Moreover, DeepRT was able to accommodate to the peptides that were separated by different types of LC, such as strong cation exchange (SCX) and hydrophilic interaction liquid chromatography (HILIC) and to reach the RT prediction with R2 values of ∼0.996 for SCX and ∼0.993 for HILIC, respectively. If a large peptide data set is available for one type of LC, DeepRT can be promoted to DeepRT(+) using transfer learning. Based on a large peptide data set gained from SWATH, DeepRT(+) further elevated the accuracy of RT prediction for peptides in a small data set and enabled a satisfactory prediction upon limited peptides approximating hundreds. Further, DeepRT automatically learns retention-related properties of amino acids under different separation mechanisms, which are well consistent with retention coefficients (Rc) of the amino acids. DeepRT was thus proven to be an improved RT predictor with high flexibility and efficiency. DeepRT is available at https://github.com/horsepurve/DeepRTplus .
Collapse
Affiliation(s)
- Chunwei Ma
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Yan Ren
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Jiarui Yang
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Zhe Ren
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| | - Huanming Yang
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,James D. Watson Institute of Genome Sciences , Hangzhou 310008 , China
| | - Siqi Liu
- BGI-Shenzhen , Beishan Industrial Zone 11th Building, Yantian District, Shenzhen , Guangdong 518083 , China.,China National GeneBank , BGI-Shenzhen , Shenzhen 518120 , China
| |
Collapse
|
25
|
Giese SH, Ishihama Y, Rappsilber J. Peptide Retention in Hydrophilic Strong Anion Exchange Chromatography Is Driven by Charged and Aromatic Residues. Anal Chem 2018. [PMID: 29528219 PMCID: PMC5937359 DOI: 10.1021/acs.analchem.7b05157] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Hydrophilic strong anion exchange chromatography (hSAX) is becoming a popular method for the prefractionation of proteomic samples. However, the use and further development of this approach is affected by the limited understanding of its retention mechanism and the absence of elution time prediction. Using a set of 59 297 confidentially identified peptides, we performed an explorative analysis and built a predictive deep learning model. As expected, charged residues are the major contributors to the retention time through electrostatic interactions. Aspartic acid and glutamic acid have a strong retaining effect and lysine and arginine have a strong repulsion effect. In addition, we also find the involvement of aromatic amino acids. This suggests a substantial contribution of cation-π interactions to the retention mechanism. The deep learning approach was validated using 5-fold cross-validation (CV) yielding a mean prediction accuracy of 70% during CV and 68% on a hold-out validation set. The results of this study emphasize that not only electrostatic interactions but rather diverse types of interactions must be integrated to build a reliable hSAX retention time predictor.
Collapse
Affiliation(s)
- Sven H Giese
- Bioanalytics, Institute of Biotechnology , Technische Universität Berlin , 13355 Berlin , Germany
| | - Yasushi Ishihama
- Graduate School of Pharmaceutical Sciences , Kyoto University , Kyoto 606-8501 , Japan
| | - Juri Rappsilber
- Bioanalytics, Institute of Biotechnology , Technische Universität Berlin , 13355 Berlin , Germany.,Graduate School of Pharmaceutical Sciences , Kyoto University , Kyoto 606-8501 , Japan.,Wellcome Centre for Cell Biology, School of Biological Sciences , University of Edinburgh , Edinburgh EH9 3BF , United Kingdom
| |
Collapse
|
26
|
Spicer V, Krokhin OV. Peptide retention time prediction in hydrophilic interaction liquid chromatography. Comparison of separation selectivity between bare silica and bonded stationary phases. J Chromatogr A 2018; 1534:75-84. [DOI: 10.1016/j.chroma.2017.12.046] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Revised: 12/14/2017] [Accepted: 12/16/2017] [Indexed: 01/01/2023]
|