1
|
Korchak J, Jeffery ED, Bandyopadhyay S, Jordan BT, Lehe MD, Watts EF, Fenix A, Wilhelm M, Sheynkman GM. IS-PRM-Based Peptide Targeting Informed by Long-Read Sequencing for Alternative Proteome Detection. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:2614-2630. [PMID: 39012054 PMCID: PMC11544703 DOI: 10.1021/jasms.4c00119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 05/24/2024] [Accepted: 06/25/2024] [Indexed: 07/17/2024]
Abstract
Alternative splicing is a major contributor of transcriptomic complexity, but the extent to which transcript isoforms are translated into stable, functional protein isoforms is unclear. Furthermore, detection of relatively scarce isoform-specific peptides is challenging, with many protein isoforms remaining uncharted due to technical limitations. Recently, a family of advanced targeted MS strategies, termed internal standard parallel reaction monitoring (IS-PRM), have demonstrated multiplexed, sensitive detection of predefined peptides of interest. Such approaches have not yet been used to confirm existence of novel peptides. Here, we present a targeted proteogenomic approach that leverages sample-matched long-read RNA sequencing (lrRNA-seq) data to predict potential protein isoforms with prior transcript evidence. Predicted tryptic isoform-specific peptides, which are specific to individual gene product isoforms, serve as "triggers" and "targets" in the IS-PRM method, Tomahto. Using the model human stem cell line WTC11, LR RNaseq data were generated and used to inform the generation of synthetic standards for 192 isoform-specific peptides (114 isoforms from 55 genes). These synthetic "trigger" peptides were labeled with super heavy tandem mass tags (TMT) and spiked into TMT-labeled WTC11 tryptic digest, predicted to contain corresponding endogenous "target" peptides. Compared to DDA mode, Tomahto increased detectability of isoforms by 3.6-fold, resulting in the identification of five previously unannotated isoforms. Our method detected protein isoform expression for 43 out of 55 genes corresponding to 54 resolved isoforms. This lrRNA-seq-informed Tomahto targeted approach is a new modality for generating protein-level evidence of alternative isoforms─a critical first step in designing functional studies and eventually clinical assays.
Collapse
Affiliation(s)
- Jennifer
A. Korchak
- Department
of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Erin D. Jeffery
- Department
of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Saikat Bandyopadhyay
- Department
of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
- Center
for Public Health Genomics, University of
Virginia, Charlottesville, Virginia 22903, United States
| | - Ben T. Jordan
- Cancer
Genomics Research Laboratory, Frederick
National Laboratory for Cancer Research, Frederick, Maryland 21701, United States
| | - Micah D. Lehe
- Department
of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Emily F. Watts
- Department
of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
| | - Aidan Fenix
- Department
of Laboratory Medicine and Pathology, University
of Washington, Seattle, Washington 98195, United States
| | - Mathias Wilhelm
- Computational
Mass Spectrometry, Technical University
of Munich (TUM), D-85354 Freising, Germany
| | - Gloria M. Sheynkman
- Department
of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia 22903, United States
- Department
of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, Virginia 22903, United States
- UVA
Comprehensive Cancer Center, University
of Virginia, Charlottesville, Virginia 22903, United States
| |
Collapse
|
2
|
Wen B, Hsu C, Zeng WF, Riffle M, Chang A, Mudge M, Nunn B, Berg MD, Villén J, MacCoss MJ, Noble WS. Carafe enables high quality in silico spectral library generation for data-independent acquisition proteomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.15.618504. [PMID: 39463980 PMCID: PMC11507862 DOI: 10.1101/2024.10.15.618504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
Data-independent acquisition (DIA)-based mass spectrometry is becoming an increasingly popular mass spectrometry acquisition strategy for carrying out quantitative proteomics experiments. Most of the popular DIA search engines make use of in silico generated spectral libraries. However, the generation of high-quality spectral libraries for DIA data analysis remains a challenge, particularly because most such libraries are generated directly from data-dependent acquisition (DDA) data or are from in silico prediction using models trained on DDA data. In this study, we developed Carafe, a tool that generates high-quality experiment-specific in silico spectral libraries by training deep learning models directly on DIA data. We demonstrate the performance of Carafe on a wide range of DIA datasets, where we observe improved fragment ion intensity prediction and peptide detection relative to existing pretrained DDA models.
Collapse
Affiliation(s)
- Bo Wen
- Department of Genome Sciences, University of Washington
| | - Chris Hsu
- Department of Genome Sciences, University of Washington
| | - Wen-Feng Zeng
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Germany
| | | | - Alexis Chang
- Department of Genome Sciences, University of Washington
| | - Miranda Mudge
- Department of Genome Sciences, University of Washington
| | - Brook Nunn
- Department of Genome Sciences, University of Washington
| | | | - Judit Villén
- Department of Genome Sciences, University of Washington
| | | | - William S. Noble
- Department of Genome Sciences, University of Washington
- Paul G. Allen School of Computer Science and Engineering, University of Washington
| |
Collapse
|
3
|
Petrovskiy DV, Nikolsky KS, Kulikova LI, Rudnev VR, Butkova TV, Malsagova KA, Kopylov AT, Kaysheva AL. PowerNovo: de novo peptide sequencing via tandem mass spectrometry using an ensemble of transformer and BERT models. Sci Rep 2024; 14:15000. [PMID: 38951578 PMCID: PMC11217302 DOI: 10.1038/s41598-024-65861-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Accepted: 06/25/2024] [Indexed: 07/03/2024] Open
Abstract
The primary objective of analyzing the data obtained in a mass spectrometry-based proteomic experiment is peptide and protein identification, or correct assignment of the tandem mass spectrum to one amino acid sequence. Comparison of empirical fragment spectra with the theoretical predicted one or matching with the collected spectra library are commonly accepted strategies of proteins identification and defining of their amino acid sequences. Although these approaches are widely used and are appreciably efficient for the well-characterized model organisms or measured proteins, they cannot detect novel peptide sequences that have not been previously annotated or are rare. This study presents PowerNovo tool for de novo sequencing of proteins using tandem mass spectra acquired in a variety of types of mass analyzers and different fragmentation techniques. PowerNovo involves an ensemble of models for peptide sequencing: model for detecting regularities in tandem mass spectra, precursors, and fragment ions and a natural language processing model, which has a function of peptide sequence quality assessment and helps with reconstruction of noisy sequences. The results of testing showed that the performance of PowerNovo is comparable and even better than widely utilized PointNovo, DeepNovo, Casanovo, and Novor packages. Also, PowerNovo provides complete cycle of processing (pipeline) of mass spectrometry data and, along with predicting the peptide sequence, involves the peptide assembly and protein inference blocks.
Collapse
|
4
|
Peters-Clarke TM, Coon JJ, Riley NM. Instrumentation at the Leading Edge of Proteomics. Anal Chem 2024; 96:7976-8010. [PMID: 38738990 PMCID: PMC11996003 DOI: 10.1021/acs.analchem.3c04497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2024]
Affiliation(s)
- Trenton M. Peters-Clarke
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Joshua J. Coon
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, USA
- Morgridge Institute for Research, Madison, WI, USA
| | | |
Collapse
|
5
|
Korchak JA, Jeffery ED, Bandyopadhyay S, Jordan BT, Lehe M, Watts EF, Fenix A, Wilhelm M, Sheynkman GM. IS-PRM-based peptide targeting informed by long-read sequencing for alternative proteome detection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.01.587549. [PMID: 38617311 PMCID: PMC11014528 DOI: 10.1101/2024.04.01.587549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Alternative splicing is a major contributor of transcriptomic complexity, but the extent to which transcript isoforms are translated into stable, functional protein isoforms is unclear. Furthermore, detection of relatively scarce isoform-specific peptides is challenging, with many protein isoforms remaining uncharted due to technical limitations. Recently, a family of advanced targeted MS strategies, termed internal standard parallel reaction monitoring (IS-PRM), have demonstrated multiplexed, sensitive detection of pre-defined peptides of interest. Such approaches have not yet been used to confirm existence of novel peptides. Here, we present a targeted proteogenomic approach that leverages sample-matched long-read RNA sequencing (LR RNAseq) data to predict potential protein isoforms with prior transcript evidence. Predicted tryptic isoform-specific peptides, which are specific to individual gene product isoforms, serve as "triggers" and "targets" in the IS-PRM method, Tomahto. Using the model human stem cell line WTC11, LR RNAseq data were generated and used to inform the generation of synthetic standards for 192 isoform-specific peptides (114 isoforms from 55 genes). These synthetic "trigger" peptides were labeled with super heavy tandem mass tags (TMT) and spiked into TMT-labeled WTC11 tryptic digest, predicted to contain corresponding endogenous "target" peptides. Compared to DDA mode, Tomahto increased detectability of isoforms by 3.6-fold, resulting in the identification of five previously unannotated isoforms. Our method detected protein isoform expression for 43 out of 55 genes corresponding to 54 resolved isoforms. This LR RNA seq-informed Tomahto targeted approach, called LRP-IS-PRM, is a new modality for generating protein-level evidence of alternative isoforms - a critical first step in designing functional studies and eventually clinical assays.
Collapse
Affiliation(s)
- Jennifer A. Korchak
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Erin D. Jeffery
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Saikat Bandyopadhyay
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Ben T. Jordan
- Cancer Genomics Research Laboratory, Frederick National Laboratory for Cancer Research, Frederick, MD USA
| | - Micah Lehe
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Emily F. Watts
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
| | - Aidan Fenix
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
| | - Mathias Wilhelm
- Computational Mass Spectrometry, Technical University of Munich (TUM), D-85354 Freising, Germany
| | - Gloria M. Sheynkman
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
- UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
6
|
Lou R, Shui W. Acquisition and Analysis of DIA-Based Proteomic Data: A Comprehensive Survey in 2023. Mol Cell Proteomics 2024; 23:100712. [PMID: 38182042 PMCID: PMC10847697 DOI: 10.1016/j.mcpro.2024.100712] [Citation(s) in RCA: 29] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/27/2023] [Accepted: 01/02/2024] [Indexed: 01/07/2024] Open
Abstract
Data-independent acquisition (DIA) mass spectrometry (MS) has emerged as a powerful technology for high-throughput, accurate, and reproducible quantitative proteomics. This review provides a comprehensive overview of recent advances in both the experimental and computational methods for DIA proteomics, from data acquisition schemes to analysis strategies and software tools. DIA acquisition schemes are categorized based on the design of precursor isolation windows, highlighting wide-window, overlapping-window, narrow-window, scanning quadrupole-based, and parallel accumulation-serial fragmentation-enhanced DIA methods. For DIA data analysis, major strategies are classified into spectrum reconstruction, sequence-based search, library-based search, de novo sequencing, and sequencing-independent approaches. A wide array of software tools implementing these strategies are reviewed, with details on their overall workflows and scoring approaches at different steps. The generation and optimization of spectral libraries, which are critical resources for DIA analysis, are also discussed. Publicly available benchmark datasets covering global proteomics and phosphoproteomics are summarized to facilitate performance evaluation of various software tools and analysis workflows. Continued advances and synergistic developments of versatile components in DIA workflows are expected to further enhance the power of DIA-based proteomics.
Collapse
Affiliation(s)
- Ronghui Lou
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| | - Wenqing Shui
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| |
Collapse
|
7
|
Cadang L, Tam CYJ, Moore BN, Fichtl J, Yang F. A Highly Efficient Workflow for Detecting and Identifying Sequence Variants in Therapeutic Proteins with a High Resolution LC-MS/MS Method. Molecules 2023; 28:molecules28083392. [PMID: 37110623 PMCID: PMC10144261 DOI: 10.3390/molecules28083392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 04/03/2023] [Accepted: 04/10/2023] [Indexed: 04/29/2023] Open
Abstract
Large molecule protein therapeutics have steadily grown and now represent a significant portion of the overall pharmaceutical market. These complex therapies are commonly manufactured using cell culture technology. Sequence variants (SVs) are undesired minor variants that may arise from the cell culture biomanufacturing process that can potentially affect the safety and efficacy of a protein therapeutic. SVs have unintended amino acid substitutions and can come from genetic mutations or translation errors. These SVs can either be detected using genetic screening methods or by mass spectrometry (MS). Recent advances in Next-generation Sequencing (NGS) technology have made genetic testing cheaper, faster, and more convenient compared to time-consuming low-resolution tandem MS and Mascot Error Tolerant Search (ETS)-based workflows which often require ~6 to 8 weeks data turnaround time. However, NGS still cannot detect non-genetic derived SVs while MS analysis can do both. Here, we report a highly efficient Sequence Variant Analysis (SVA) workflow using high-resolution MS and tandem mass spectrometry combined with improved software to greatly reduce the time and resource cost associated with MS SVA workflows. Method development was performed to optimize the high-resolution tandem MS and software score cutoff for both SV identification and quantitation. We discovered that a feature of the Fusion Lumos caused significant relative under-quantitation of low-level peptides and turned it off. A comparison of common Orbitrap platforms showed that similar quantitation values were obtained on a spiked-in sample. With this new workflow, the amount of false positive SVs was decreased by up to 93%, and SVA turnaround time by LC-MS/MS was shortened to 2 weeks, comparable to NGS analysis speed and making LC-MS/MS the top choice for SVA workflow.
Collapse
Affiliation(s)
- Lance Cadang
- Pharma Technical Development, Genentech, South San Francisco, CA 94080, USA
| | - Chi Yan Janet Tam
- Pharma Technical Development, Genentech, South San Francisco, CA 94080, USA
| | | | - Juergen Fichtl
- Pharma Technical Development, Roche Diagnostics GmbH, 82377 Penzberg, Germany
| | - Feng Yang
- Pharma Technical Development, Genentech, South San Francisco, CA 94080, USA
| |
Collapse
|
8
|
Silzel J, Julian RR. RDD-HCD Provides Variable Fragmentation Routes Dictated by Radical Stability. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2023; 34:452-458. [PMID: 36787650 PMCID: PMC9982999 DOI: 10.1021/jasms.2c00326] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 01/24/2023] [Accepted: 02/03/2023] [Indexed: 06/18/2023]
Abstract
Radical-directed dissociation (RDD) is a fragmentation technique in which a radical created by selective 213/266 nm photodissociation of a carbon-iodine bond is reisolated and collisionally activated. In previous RDD experiments, collisional activation was effected by ion-trap collision-induced dissociation (CID). Higher-energy collisional dissociation (HCD) differs from CID both in terms of how ions are excited and in the number, type, or abundance of fragments that are observed. In this paper, we explore the use of HCD for activation in RDD experiments. While RDD-CID favors fragments produced from radical-directed pathways such as a/z-ions and side chain losses regardless of the activation energy employed, RDD-HCD spectra vary considerably as a function of activation energy, with lower energies favoring RDD while higher energies favor products resulting from cleavage directed by mobile protons (b/y-ions). RDD-HCD therefore affords more tunable fragmentation based on the HCD energy provided. Importantly, the abundance of radical products decreases as a function of increasing HCD energy, confirming that RDD generally proceeds via lower-energy barriers relative to mobile-proton-driven dissociation. The dominance of b/y-ions at higher energies for RDD-HCD can therefore be explained by the higher survivability of fragments not containing the radical after the initial or subsequent dissociation events. Furthermore, these results confirm previous suspicions that HCD spectra differ from CID spectra due to multiple dissociation events.
Collapse
|
9
|
Abstract
Spectrum library searching is a powerful alternative to database searching for data dependent acquisition experiments, but has been historically limited to identifying previously observed peptides in libraries. Here we present Scribe, a new library search engine designed to leverage deep learning fragmentation prediction software such as Prosit. Rather than relying on highly curated DDA libraries, this approach predicts fragmentation and retention times for every peptide in a FASTA database. Scribe embeds Percolator for false discovery rate correction and an interference tolerant, label-free quantification integrator for an end-to-end proteomics workflow. By leveraging expected relative fragmentation and retention time values, we find that library searching with Scribe can outperform traditional database searching tools both in terms of sensitivity and quantitative precision. Scribe and its graphical interface are easy to use, freely accessible, and fully open source.
Collapse
Affiliation(s)
- Brian C Searle
- Department of Biomedical Informatics, The Ohio State University Medical Center, Columbus, Ohio43210, United States
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio43210, United States
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio43210, United States
- Proteome Software Inc., Portland, Oregon97219, United States
| | - Ariana E Shannon
- Department of Biomedical Informatics, The Ohio State University Medical Center, Columbus, Ohio43210, United States
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio43210, United States
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio43210, United States
| | - Damien Beau Wilburn
- Department of Biomedical Informatics, The Ohio State University Medical Center, Columbus, Ohio43210, United States
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio43210, United States
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio43210, United States
| |
Collapse
|
10
|
Cho BG, Gutierrez Reyes CD, Goli M, Gautam S, Banazadeh A, Mechref Y. Targeted N-Glycan Analysis with Parallel Reaction Monitoring Using a Quadrupole-Orbitrap Hybrid Mass Spectrometer. Anal Chem 2022; 94:15215-15222. [DOI: 10.1021/acs.analchem.2c01975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Byeong Gwan Cho
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409-1061, United States
| | | | - Mona Goli
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409-1061, United States
| | - Sakshi Gautam
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409-1061, United States
| | - Alireza Banazadeh
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409-1061, United States
| | - Yehia Mechref
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, Texas 79409-1061, United States
| |
Collapse
|
11
|
Abdullah AM, Sommers C, Hawes J, Rodriguez JD, Yang K. Tandem mass spectrometric sequence characterization of synthetic thymidine-rich oligonucleotides. JOURNAL OF MASS SPECTROMETRY : JMS 2022; 57:e4819. [PMID: 35347805 PMCID: PMC9287059 DOI: 10.1002/jms.4819] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Revised: 02/25/2022] [Accepted: 03/04/2022] [Indexed: 05/30/2023]
Abstract
Tandem mass spectrometry (MS/MS) can provide direct and accurate sequence characterization of synthetic oligonucleotide drugs, including modified oligonucleotides. Multiple factors can affect oligonucleotide MS/MS sequencing, including the intrinsic properties of oligonucleotides (i.e., nucleotide composition and structural modifications) and instrument parameters associated with the ion activation for fragmentation. In this study, MS/MS sequencing of a thymidine (T)-rich and phosphorothioate (PS)-modified DNA oligonucleotide was investigated using two fragmentation techniques: trap-type collision-induced dissociation ("CID") and beam-type CID also termed as higher-energy collisional dissociation ("HCD"), preceded by a hydrophilic interaction liquid chromatography (HILIC) separation. A low to moderate charge state (-4), which predominated under the optimized HILIC-MS conditions, was selected as the precursor ion for MS/MS analysis. Comparison of the two distinctive ion activation mechanisms on the same precursor demonstrated that HCD was superior to CID in promoting higher sequence coverage and analytical sensitivity in sequence elucidation of T-rich DNA oligonucleotides. Specifically, HCD provided more sequence-defining fragments with higher fragment intensities than CID. Furthermore, the direct comparison between unmodified and PS-modified DNA oligonucleotides demonstrated a loss of MS/MS fragmentation efficiency by PS modification in both CID and HCD approaches, and a resultant reduction in sequence coverage. The deficiency in PS DNA sequence coverage observed with single collision energy HCD, however, was partially recovered by applying HCD with multiple collision energies. Collectively, this work demonstrated that HCD is advantageous to MS/MS sequencing of T-rich PS-modified DNA oligonucleotides.
Collapse
Affiliation(s)
- A. M. Abdullah
- Division of Complex Drug Analysis, Office of Testing and Research, Office of Pharmaceutical Quality, Center for Drug Evaluation and ResearchU.S. Food and Drug AdministrationSaint LouisMissouriUSA
| | - Cynthia Sommers
- Division of Complex Drug Analysis, Office of Testing and Research, Office of Pharmaceutical Quality, Center for Drug Evaluation and ResearchU.S. Food and Drug AdministrationSaint LouisMissouriUSA
| | - Jessica Hawes
- Division of Systems Biology, National Center for Toxicological ResearchU.S. Food and Drug AdministrationJeffersonArkansasUSA
| | - Jason D. Rodriguez
- Division of Complex Drug Analysis, Office of Testing and Research, Office of Pharmaceutical Quality, Center for Drug Evaluation and ResearchU.S. Food and Drug AdministrationSaint LouisMissouriUSA
| | - Kui Yang
- Division of Complex Drug Analysis, Office of Testing and Research, Office of Pharmaceutical Quality, Center for Drug Evaluation and ResearchU.S. Food and Drug AdministrationSaint LouisMissouriUSA
| |
Collapse
|
12
|
Richards AL, Chen KH, Wilburn DB, Stevenson E, Polacco BJ, Searle BC, Swaney DL. Data-Independent Acquisition Protease-Multiplexing Enables Increased Proteome Sequence Coverage Across Multiple Fragmentation Modes. J Proteome Res 2022; 21:1124-1136. [PMID: 35234472 PMCID: PMC9035370 DOI: 10.1021/acs.jproteome.1c00960] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The use of multiple proteases has been shown to increase protein sequence coverage in proteomics experiments; however, due to the additional analysis time required, it has not been widely adopted in routine data-dependent acquisition (DDA) proteomic workflows. Alternatively, data-independent acquisition (DIA) has the potential to analyze multiplexed samples from different protease digests, but has been primarily optimized for fragmenting tryptic peptides. Here we evaluate a DIA multiplexing approach that combines three proteolytic digests (Trypsin, AspN, and GluC) into a single sample. We first optimize data acquisition conditions for each protease individually with both the canonical DIA fragmentation mode (beam type CID), as well as resonance excitation CID, to determine optimal consensus conditions across proteases. Next, we demonstrate that application of these conditions to a protease-multiplexed sample of human peptides results in similar protein identifications and quantitative performance as compared to trypsin alone, but enables up to a 63% increase in peptide detections, and a 45% increase in nonredundant amino acid detections. Nontryptic peptides enabled noncanonical protein isoform determination and resulted in 100% sequence coverage for numerous proteins, suggesting the utility of this approach in applications where sequence coverage is critical, such as protein isoform analysis.
Collapse
Affiliation(s)
- Alicia L Richards
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Kuei-Ho Chen
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Damien B Wilburn
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio 43210, United States.,Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio 43210, United States.,Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - Erica Stevenson
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Benjamin J Polacco
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| | - Brian C Searle
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio 43210, United States.,Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio 43210, United States
| | - Danielle L Swaney
- Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California 94158, United States.,J. David Gladstone Institutes, San Francisco, California 94158, United States.,Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California 94158, United States
| |
Collapse
|