1
|
Ng CCA, Zhou Y, Yao ZP. Algorithms for de-novo sequencing of peptides by tandem mass spectrometry: A review. Anal Chim Acta 2023; 1268:341330. [PMID: 37268337 DOI: 10.1016/j.aca.2023.341330] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 05/04/2023] [Accepted: 05/06/2023] [Indexed: 06/04/2023]
Abstract
Peptide sequencing is of great significance to fundamental and applied research in the fields such as chemical, biological, medicinal and pharmaceutical sciences. With the rapid development of mass spectrometry and sequencing algorithms, de-novo peptide sequencing using tandem mass spectrometry (MS/MS) has become the main method for determining amino acid sequences of novel and unknown peptides. Advanced algorithms allow the amino acid sequence information to be accurately obtained from MS/MS spectra in short time. In this review, algorithms from exhaustive search to the state-of-art machine learning and neural network for high-throughput and automated de-novo sequencing are introduced and compared. Impacts of datasets on algorithm performance are highlighted. The current limitations and promising direction of de-novo peptide sequencing are also discussed in this review.
Collapse
Affiliation(s)
- Cheuk Chi A Ng
- State Key Laboratory of Chemical Biology and Drug Discovery, and Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; Research Institute for Future Food, and Research Center for Chinese Medicine Innovation, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; State Key Laboratory of Chinese Medicine and Molecular Pharmacology (Incubation), and Shenzhen Key Laboratory of Food Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518057, China
| | - Yin Zhou
- State Key Laboratory of Chemical Biology and Drug Discovery, and Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; Research Institute for Future Food, and Research Center for Chinese Medicine Innovation, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; State Key Laboratory of Chinese Medicine and Molecular Pharmacology (Incubation), and Shenzhen Key Laboratory of Food Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518057, China
| | - Zhong-Ping Yao
- State Key Laboratory of Chemical Biology and Drug Discovery, and Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; Research Institute for Future Food, and Research Center for Chinese Medicine Innovation, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong Special Administrative Region of China; State Key Laboratory of Chinese Medicine and Molecular Pharmacology (Incubation), and Shenzhen Key Laboratory of Food Biological Safety Control, The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, 518057, China.
| |
Collapse
|
2
|
Oomens J, Kempkes LJM, Geurts TPJ, van Dijk L, Martens J, Berden G, Armentrout PB. Water Loss from Protonated XxxSer and XxxThr Dipeptides Gives Oxazoline-Not Oxazolone-Product Ions. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2020; 31:2111-2123. [PMID: 32876444 PMCID: PMC7552115 DOI: 10.1021/jasms.0c00239] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 08/31/2020] [Accepted: 09/02/2020] [Indexed: 06/11/2023]
Abstract
Neutral loss of water and ammonia are often significant fragmentation channels upon collisional activation of protonated peptides. Here, we deploy infrared ion spectroscopy to investigate the dehydration reactions of protonated AlaSer, AlaThr, GlySer, GlyThr, PheSer, PheThr, ProSer, ProThr, AsnSer, and AsnThr, focusing on the question of the structure of the resulting [M + H - H2O]+ fragment ion and the site from which H2O is expelled. In all cases, the second residue of the selected peptides contains a hydroxyl moiety, so that H2O loss can potentially occur from this side-chain, as an alternative to loss from the C-terminal free acid of the dipeptide. Infrared action spectra of the product ions along with quantum-chemical calculations unambiguously show that dehydration consistently produces fragment ions containing an oxazoline moiety. This contrasts with the common oxazolone structure that would result from dehydration at the C-terminus analogous to the common b/y dissociation forming regular b2-type sequence ions. The oxazoline product structure suggests a reaction mechanism involving water loss from the Ser/Thr side-chain with concomitant nucleophilic attack of the amide carbonyl oxygen at its β-carbon, forming an oxazoline ring. However, an extensive quantum-chemical investigation comparing the potential energy surfaces for three entirely different dehydration reaction pathways indicates that it is actually the backbone amide oxygen atom that leaves as the water molecule.
Collapse
Affiliation(s)
- Jos Oomens
- FELIX Laboratory, Institute of Molecules and
Materials, Radboud University, Toernooiveld 7, 6525 ED
Nijmegen, The Netherlands
- Van’t Hoff Institute for Molecular Sciences,
University of Amsterdam, Science Park 904, 1098 XH Amsterdam,
The Netherlands
| | - Lisanne J. M. Kempkes
- FELIX Laboratory, Institute of Molecules and
Materials, Radboud University, Toernooiveld 7, 6525 ED
Nijmegen, The Netherlands
| | - Thijs P. J. Geurts
- FELIX Laboratory, Institute of Molecules and
Materials, Radboud University, Toernooiveld 7, 6525 ED
Nijmegen, The Netherlands
| | - Luuk van Dijk
- FELIX Laboratory, Institute of Molecules and
Materials, Radboud University, Toernooiveld 7, 6525 ED
Nijmegen, The Netherlands
| | - Jonathan Martens
- FELIX Laboratory, Institute of Molecules and
Materials, Radboud University, Toernooiveld 7, 6525 ED
Nijmegen, The Netherlands
| | - Giel Berden
- FELIX Laboratory, Institute of Molecules and
Materials, Radboud University, Toernooiveld 7, 6525 ED
Nijmegen, The Netherlands
| | - P. B. Armentrout
- Department of Chemistry, University of
Utah, 315 South 1400 East, Room 2020, Salt Lake City, Utah 84112,
United States
| |
Collapse
|
3
|
Kuchibhotla B, Kola SR, Medicherla JV, Cherukuvada SV, Dhople VM, Nalam MR. Combinatorial Labeling Method for Improving Peptide Fragmentation in Mass Spectrometry. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2017; 28:1216-1226. [PMID: 28349438 DOI: 10.1007/s13361-017-1606-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Revised: 12/30/2016] [Accepted: 01/09/2017] [Indexed: 06/06/2023]
Abstract
Annotation of peptide sequence from tandem mass spectra constitutes the central step of mass spectrometry-based proteomics. Peptide mass spectra are obtained upon gas-phase fragmentation. Identification of the protein from a set of experimental peptide spectral matches is usually referred as protein inference. Occurrence and intensity of these fragment ions in the MS/MS spectra are dependent on many factors such as amino acid composition, peptide basicity, activation mode, protease, etc. Particularly, chemical derivatizations of peptides were known to alter their fragmentation. In this study, the influence of acetylation, guanidinylation, and their combination on peptide fragmentation was assessed initially on a lipase (LipA) from Bacillus subtilis followed by a bovine six protein mix digest. The dual modification resulted in improved fragment ion occurrence and intensity changes, and this resulted in the equivalent representation of b- and y-type fragment ions in an ion trap MS/MS spectrum. The improved representation has allowed us to accurately annotate the peptide sequences de novo. Dual labeling has significantly reduced the false positive protein identifications in standard bovine six peptide digest. Our study suggests that the combinatorial labeling of peptides is a useful method to validate protein identifications for high confidence protein inference. Graphical Abstract ᅟ.
Collapse
Affiliation(s)
- Bhanuramanand Kuchibhotla
- Center for Cellular and Molecular Biology, Council of Scientific and Industrial Research, Uppal Road, Hyderabad, 500007, Telangana, India
| | - Sankara Rao Kola
- Center for Cellular and Molecular Biology, Council of Scientific and Industrial Research, Uppal Road, Hyderabad, 500007, Telangana, India
| | - Jagannadham V Medicherla
- Center for Cellular and Molecular Biology, Council of Scientific and Industrial Research, Uppal Road, Hyderabad, 500007, Telangana, India
| | - Swamy V Cherukuvada
- Center for Cellular and Molecular Biology, Council of Scientific and Industrial Research, Uppal Road, Hyderabad, 500007, Telangana, India
| | - Vishnu M Dhople
- Department of Functional Genomics, University Medicine Greifswald, Interface Institute Genetics & Functional Genomics, D-17475, Greifswald, Germany
| | - Madhusudhana Rao Nalam
- Center for Cellular and Molecular Biology, Council of Scientific and Industrial Research, Uppal Road, Hyderabad, 500007, Telangana, India.
| |
Collapse
|
4
|
Stover ML, Plummer CE, Miller SR, Cassady CJ, Dixon DA. Gas-Phase Acidities of Phosphorylated Amino Acids. J Phys Chem B 2015; 119:14604-21. [DOI: 10.1021/acs.jpcb.5b08616] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Michele L. Stover
- Chemistry
Department, Shelby
Hall, The University of Alabama, Shelby Hall, Box
870336, Tuscaloosa, Alabama 35487-0336, United States
| | - Chelsea E. Plummer
- Chemistry
Department, Shelby
Hall, The University of Alabama, Shelby Hall, Box
870336, Tuscaloosa, Alabama 35487-0336, United States
| | - Sean R. Miller
- Chemistry
Department, Shelby
Hall, The University of Alabama, Shelby Hall, Box
870336, Tuscaloosa, Alabama 35487-0336, United States
| | - Carolyn J. Cassady
- Chemistry
Department, Shelby
Hall, The University of Alabama, Shelby Hall, Box
870336, Tuscaloosa, Alabama 35487-0336, United States
| | - David A. Dixon
- Chemistry
Department, Shelby
Hall, The University of Alabama, Shelby Hall, Box
870336, Tuscaloosa, Alabama 35487-0336, United States
| |
Collapse
|
5
|
Medzihradszky KF, Chalkley RJ. Lessons in de novo peptide sequencing by tandem mass spectrometry. MASS SPECTROMETRY REVIEWS 2015; 34:43-63. [PMID: 25667941 PMCID: PMC4367481 DOI: 10.1002/mas.21406] [Citation(s) in RCA: 148] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Mass spectrometry has become the method of choice for the qualitative and quantitative characterization of protein mixtures isolated from all kinds of living organisms. The raw data in these studies are MS/MS spectra, usually of peptides produced by proteolytic digestion of a protein. These spectra are "translated" into peptide sequences, normally with the help of various search engines. Data acquisition and interpretation have both been automated, and most researchers look only at the summary of the identifications without ever viewing the underlying raw data used for assignments. Automated analysis of data is essential due to the volume produced. However, being familiar with the finer intricacies of peptide fragmentation processes, and experiencing the difficulties of manual data interpretation allow a researcher to be able to more critically evaluate key results, particularly because there are many known rules of peptide fragmentation that are not incorporated into search engine scoring. Since the most commonly used MS/MS activation method is collision-induced dissociation (CID), in this article we present a brief review of the history of peptide CID analysis. Next, we provide a detailed tutorial on how to determine peptide sequences from CID data. Although the focus of the tutorial is de novo sequencing, the lessons learned and resources supplied are useful for data interpretation in general.
Collapse
|
6
|
Lanucara F, Chi Hoo Lee D, Eyers CE. Unblocking the sink: improved CID-based analysis of phosphorylated peptides by enzymatic removal of the basic C-terminal residue. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2014; 25:214-25. [PMID: 24297471 PMCID: PMC3899453 DOI: 10.1007/s13361-013-0770-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Revised: 10/08/2013] [Accepted: 10/08/2013] [Indexed: 06/02/2023]
Abstract
A one-step enzymatic reaction for improving the collision-induced dissociation (CID)-based tandem mass spectrometry (MS/MS) analysis of phosphorylated peptides in an ion trap is presented. Carboxypeptidase-B (CBP-B) was used to selectively remove C-terminal arginine or lysine residues from phosphorylated tryptic/Lys-C peptides prior to their MS/MS analysis by CID with a Paul-type ion trap. Removal of this basic C-terminal residue served to limit the extent of gas-phase neutral loss of phosphoric acid (H3PO4), favoring the formation of diagnostic b and y ions as determined by an increase in both the number and relative intensities of the sequence-specific product ions. Such differential fragmentation is particularly valuable when the H3PO4 elimination is so predominant that localizing the phosphorylation site on the peptide sequence is hindered. Improvement in the quality of tandem mass spectral data generated by CID upon CBP-B treatment resulted in greater confidence both in assignment of the phosphopeptide primary sequence and for pinpointing the site of phosphorylation. Higher Mascot ion scores were also generated, combined with lower expectation values and higher delta scores for improved confidence in site assignment; Ascore values also improved. These results are rationalized in accordance with the accepted mechanisms for the elimination of H3PO4 upon low energy CID and insights into the factors dictating the observed dissociation pathways are presented. We anticipate this approach will be of utility in the MS analysis of phosphorylated peptides, especially when alternative electron-driven fragmentation techniques are not available.
Collapse
Affiliation(s)
- Francesco Lanucara
- Manchester Institute of Biotechnology, Michael Barber Centre for Mass Spectrometry, School of Chemistry, University of Manchester, Manchester, M1 7DN UK
- Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB UK
| | - Dave Chi Hoo Lee
- Manchester Institute of Biotechnology, Michael Barber Centre for Mass Spectrometry, School of Chemistry, University of Manchester, Manchester, M1 7DN UK
| | - Claire E. Eyers
- Manchester Institute of Biotechnology, Michael Barber Centre for Mass Spectrometry, School of Chemistry, University of Manchester, Manchester, M1 7DN UK
- Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB UK
| |
Collapse
|