1
|
Habeck T, Brown KA, Des Soye B, Lantz C, Zhou M, Alam N, Hossain MA, Jung W, Keener JE, Volny M, Wilson JW, Ying Y, Agar JN, Danis PO, Ge Y, Kelleher NL, Li H, Loo JA, Marty MT, Paša-Tolić L, Sandoval W, Lermyte F. Top-down mass spectrometry of native proteoforms and their complexes: a community study. Nat Methods 2024:10.1038/s41592-024-02279-6. [PMID: 38744918 DOI: 10.1038/s41592-024-02279-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 04/10/2024] [Indexed: 05/16/2024]
Abstract
The combination of native electrospray ionization with top-down fragmentation in mass spectrometry (MS) allows simultaneous determination of the stoichiometry of noncovalent complexes and identification of their component proteoforms and cofactors. Although this approach is powerful, both native MS and top-down MS are not yet well standardized, and only a limited number of laboratories regularly carry out this type of research. To address this challenge, the Consortium for Top-Down Proteomics initiated a study to develop and test protocols for native MS combined with top-down fragmentation of proteins and protein complexes across 11 instruments in nine laboratories. Here we report the summary of the outcomes to provide robust benchmarks and a valuable entry point for the scientific community.
Collapse
Affiliation(s)
- Tanja Habeck
- Technische Universität Darmstadt, Darmstadt, Germany
| | - Kyle A Brown
- University of Wisconsin-Madison, Madison, WI, USA
| | | | | | - Mowei Zhou
- Pacific Northwest National Laboratory, Richland, WA, USA
- Zhejiang University, Zhejiang, China
| | | | | | | | | | | | - Jesse W Wilson
- Pacific Northwest National Laboratory, Richland, WA, USA
| | - Yujia Ying
- Sun Yat-sen University, Guangzhou, China
| | - Jeffrey N Agar
- Northeastern University, Boston, MA, USA
- Consortium for Top-Down Proteomics, Cambridge, MA, USA
| | - Paul O Danis
- Consortium for Top-Down Proteomics, Cambridge, MA, USA
| | - Ying Ge
- University of Wisconsin-Madison, Madison, WI, USA
- Consortium for Top-Down Proteomics, Cambridge, MA, USA
| | - Neil L Kelleher
- Northwestern University, Evanston, IL, USA
- Consortium for Top-Down Proteomics, Cambridge, MA, USA
| | - Huilin Li
- Sun Yat-sen University, Guangzhou, China
| | - Joseph A Loo
- University of California, Los Angeles, CA, USA
- Consortium for Top-Down Proteomics, Cambridge, MA, USA
| | | | - Ljiljana Paša-Tolić
- Pacific Northwest National Laboratory, Richland, WA, USA
- Consortium for Top-Down Proteomics, Cambridge, MA, USA
| | | | | |
Collapse
|
2
|
Jeong K, Kaulich PT, Jung W, Kim J, Tholey A, Kohlbacher O. Precursor deconvolution error estimation: The missing puzzle piece in false discovery rate in top-down proteomics. Proteomics 2024; 24:e2300068. [PMID: 37997224 DOI: 10.1002/pmic.202300068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 11/09/2023] [Accepted: 11/13/2023] [Indexed: 11/25/2023]
Abstract
Top-down proteomics (TDP) directly analyzes intact proteins and thus provides more comprehensive qualitative and quantitative proteoform-level information than conventional bottom-up proteomics (BUP) that relies on digested peptides and protein inference. While significant advancements have been made in TDP in sample preparation, separation, instrumentation, and data analysis, reliable and reproducible data analysis still remains one of the major bottlenecks in TDP. A key step for robust data analysis is the establishment of an objective estimation of proteoform-level false discovery rate (FDR) in proteoform identification. The most widely used FDR estimation scheme is based on the target-decoy approach (TDA), which has primarily been established for BUP. We present evidence that the TDA-based FDR estimation may not work at the proteoform-level due to an overlooked factor, namely the erroneous deconvolution of precursor masses, which leads to incorrect FDR estimation. We argue that the conventional TDA-based FDR in proteoform identification is in fact protein-level FDR rather than proteoform-level FDR unless precursor deconvolution error rate is taken into account. To address this issue, we propose a formula to correct for proteoform-level FDR bias by combining TDA-based FDR and precursor deconvolution error rate.
Collapse
Affiliation(s)
- Kyowon Jeong
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
| | - Philipp T Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Wonhyeuk Jung
- Department of Cell Biology, Yale School of Medicine, New Haven, Connecticut, USA
| | - Jihyung Kim
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
- Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany
| |
Collapse
|
3
|
Habeck T, Maciel EVS, Kretschmer K, Lermyte F. Charge site manipulation to enhance top-down fragmentation efficiency. Proteomics 2024; 24:e2300082. [PMID: 37043727 DOI: 10.1002/pmic.202300082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 04/05/2023] [Accepted: 04/06/2023] [Indexed: 04/14/2023]
Abstract
In recent years, top-down mass spectrometry has become a widely used approach to study proteoforms; however, improving sequence coverage remains an important goal. Here, two different proteins, α-synuclein and bovine carbonic anhydrase, were subjected to top-down collision-induced dissociation (CID) after electrospray ionisation. Two high-boiling solvents, DMSO and propylene carbonate, were added to the protein solution in low concentration (2%) and the effects on the top-down fragmentation patterns of the proteins were systematically investigated. Each sample was measured in triplicate, which revealed highly reproducible differences in the top-down CID fragmentation patterns in the presence of a solution additive, even if the same precursor charge state was isolated in the quadrupole of the instrument. Further investigation supports the solution condition-dependent selective formation of different protonation site isomers as the underlying cause of these differences. Higher sequence coverage was often observed in the presence of additives, and the benefits of this approach became even more evident when datasets from different solution conditions were combined, as increases up to 35% in cleavage coverage were obtained. Overall, this approach therefore represents a promising opportunity to increase top-down fragmentation efficiency.
Collapse
Affiliation(s)
- Tanja Habeck
- Department of Chemistry, Clemens-Schöpf-Institute of Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Hessen, Germany
| | - Edvaldo Vasconcelos Soares Maciel
- Department of Chemistry, Clemens-Schöpf-Institute of Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Hessen, Germany
| | - Kevin Kretschmer
- Department of Chemistry, Clemens-Schöpf-Institute of Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Hessen, Germany
| | - Frederik Lermyte
- Department of Chemistry, Clemens-Schöpf-Institute of Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Hessen, Germany
| |
Collapse
|
4
|
Lermyte F, Habeck T, Brown K, Des Soye B, Lantz C, Zhou M, Alam N, Hossain MA, Jung W, Keener J, Volny M, Wilson J, Ying Y, Agar J, Danis P, Ge Y, Kelleher N, Li H, Loo J, Marty M, Pasa-Tolic L, Sandoval W. Top-down mass spectrometry of native proteoforms and their complexes: A community study. RESEARCH SQUARE 2023:rs.3.rs-3228472. [PMID: 37674709 PMCID: PMC10479449 DOI: 10.21203/rs.3.rs-3228472/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
The combination of native electrospray ionisation with top-down fragmentation in mass spectrometry allows simultaneous determination of the stoichiometry of noncovalent complexes and identification of their component proteoforms and co-factors. While this approach is powerful, both native mass spectrometry and top-down mass spectrometry are not yet well standardised, and only a limited number of laboratories regularly carry out this type of research. To address this challenge, the Consortium for Top-Down Proteomics (CTDP) initiated a study to develop and test protocols for native mass spectrometry combined with top-down fragmentation of proteins and protein complexes across eleven instruments in nine laboratories. The outcomes are summarised in this report to provide robust benchmarks and a valuable entry point for the scientific community.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | - Jeffrey Agar
- Department of Chemistry and Chemical Biology, Northeastern University
| | | | - Ying Ge
- University of Wisconsin-Madison
| | | | | | | | | | | | | |
Collapse
|
5
|
Larson EJ, Pergande MR, Moss ME, Rossler KJ, Wenger RK, Krichel B, Josyer H, Melby JA, Roberts DS, Pike K, Shi Z, Chan HJ, Knight B, Rogers HT, Brown KA, Ong IM, Jeong K, Marty MT, McIlwain SJ, Ge Y. MASH Native: a unified solution for native top-down proteomics data processing. Bioinformatics 2023; 39:btad359. [PMID: 37294807 PMCID: PMC10283151 DOI: 10.1093/bioinformatics/btad359] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 04/13/2023] [Accepted: 06/07/2023] [Indexed: 06/11/2023] Open
Abstract
MOTIVATION Native top-down proteomics (nTDP) integrates native mass spectrometry (nMS) with top-down proteomics (TDP) to provide comprehensive analysis of protein complexes together with proteoform identification and characterization. Despite significant advances in nMS and TDP software developments, a unified and user-friendly software package for analysis of nTDP data remains lacking. RESULTS We have developed MASH Native to provide a unified solution for nTDP to process complex datasets with database searching capabilities in a user-friendly interface. MASH Native supports various data formats and incorporates multiple options for deconvolution, database searching, and spectral summing to provide a "one-stop shop" for characterizing both native protein complexes and proteoforms. AVAILABILITY AND IMPLEMENTATION The MASH Native app, video tutorials, written tutorials, and additional documentation are freely available for download at https://labs.wisc.edu/gelab/MASH_Explorer/MASHSoftware.php. All data files shown in user tutorials are included with the MASH Native software in the download .zip file.
Collapse
Affiliation(s)
- Eli J Larson
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Melissa R Pergande
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Michelle E Moss
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Kalina J Rossler
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - R Kent Wenger
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
- Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Boris Krichel
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Harini Josyer
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Jake A Melby
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - David S Roberts
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Kyndalanne Pike
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Zhuoxin Shi
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Hsin-Ju Chan
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Bridget Knight
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Holden T Rogers
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Kyle A Brown
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Irene M Ong
- Department of Biostatistics and Medical Informatics, University of Wisconsin–Madison, Madison, WI 53705, United States
- University of Wisconsin Carbone Cancer Center, University of Wisconsin-Madison, Madison, WI 53705, United States
- Department of Obstetrics and Gynecology, University of Wisconsin–Madison, Madison, WI 53705, United States
| | - Kyowon Jeong
- Department of Applied Bioinformatics, University of Tübingen, Tübingen 72704, Germany
| | - Michael T Marty
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85719, United States
| | - Sean J McIlwain
- Department of Biostatistics and Medical Informatics, University of Wisconsin–Madison, Madison, WI 53705, United States
- University of Wisconsin Carbone Cancer Center, University of Wisconsin-Madison, Madison, WI 53705, United States
| | - Ying Ge
- Department of Chemistry, University of Wisconsin–Madison, Madison, WI 53705, United States
- Department of Cell and Regenerative Biology, University of Wisconsin–Madison, Madison, WI 53705, United States
- Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin–Madison, Madison, WI 53705, United States
| |
Collapse
|
6
|
Varshney N, Mishra AK. Deep Learning in Phosphoproteomics: Methods and Application in Cancer Drug Discovery. Proteomes 2023; 11:proteomes11020016. [PMID: 37218921 DOI: 10.3390/proteomes11020016] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 04/24/2023] [Accepted: 04/25/2023] [Indexed: 05/24/2023] Open
Abstract
Protein phosphorylation is a key post-translational modification (PTM) that is a central regulatory mechanism of many cellular signaling pathways. Several protein kinases and phosphatases precisely control this biochemical process. Defects in the functions of these proteins have been implicated in many diseases, including cancer. Mass spectrometry (MS)-based analysis of biological samples provides in-depth coverage of phosphoproteome. A large amount of MS data available in public repositories has unveiled big data in the field of phosphoproteomics. To address the challenges associated with handling large data and expanding confidence in phosphorylation site prediction, the development of many computational algorithms and machine learning-based approaches have gained momentum in recent years. Together, the emergence of experimental methods with high resolution and sensitivity and data mining algorithms has provided robust analytical platforms for quantitative proteomics. In this review, we compile a comprehensive collection of bioinformatic resources used for the prediction of phosphorylation sites, and their potential therapeutic applications in the context of cancer.
Collapse
Affiliation(s)
- Neha Varshney
- Division of Biological Sciences, Department of Cellular and Molecular Medicine, University of California, San Diego, CA 93093, USA
- Ludwig Institute for Cancer Research, La Jolla, CA 92093, USA
| | - Abhinava K Mishra
- Molecular, Cellular and Developmental Biology Department, University of California, Santa Barbara, CA 93106, USA
| |
Collapse
|
7
|
Dunham SD, Wei B, Lantz C, Loo JA, Brodbelt JS. Impact of Internal Fragments on Top-Down Analysis of Intact Proteins by 193 nm UVPD. J Proteome Res 2023; 22:170-181. [PMID: 36503236 DOI: 10.1021/acs.jproteome.2c00583] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
193 nm ultraviolet photodissociation (UVPD) allows high sequence coverage to be obtained for intact proteins using terminal fragments alone. However, internal fragments, those that contain neither N- nor C- terminus, are typically ignored, neglecting their potential to bolster characterization of intact proteins. Here, we explore internal fragments generated by 193 nm UVPD for proteins ranging in size from 17-47 kDa and using the ClipsMS algorithm to facilitate searches for internal fragments. Internal fragments were only retained if identified in multiple replicates in order to reduce spurious assignments and to explore the reproducibility of internal fragments generated by UVPD. Inclusion of internal fragment improved sequence coverage by an average of 18% and 32% for UVPD and HCD, respectively, across all proteins and charge states studied. However, only an average of 18% of UVPD internal fragments were identified in two out of three replicates relative to the average number identified across all replicates for all proteins studied. Conversely, for HCD, an average of 63% of internal fragments were retained across replicates. These trends reflect an increased risk of false-positive identifications and a need for caution when considering internal fragments for UVPD. Additionally, proton-transfer charge reduction (PTCR) reactions were performed following UVPD or HCD to assess the impact on internal fragment identifications, allowing up to 20% more fragment ions to be retained across multiple replicates. At this time, it is difficult to recommend the inclusion of the internal fragment when searching UVPD spectra without further work to develop strategies for reducing the possibilities of false-positive identifications. All mass spectra are available in the public repository jPOST with the accession number JPST001885.
Collapse
Affiliation(s)
- Sean D Dunham
- Department of Chemistry, University of Texas at Austin, Austin, Texas 78712, United States
| | - Benqian Wei
- Department of Chemistry and Biochemistry, University of California Los Angeles, Los Angeles, California 90095, United States
| | - Carter Lantz
- Department of Chemistry and Biochemistry, University of California Los Angeles, Los Angeles, California 90095, United States
| | - Joseph A Loo
- Department of Chemistry and Biochemistry, University of California Los Angeles, Los Angeles, California 90095, United States
| | - Jennifer S Brodbelt
- Department of Chemistry, University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
8
|
Desaire H, Go EP, Hua D. Advances, obstacles, and opportunities for machine learning in proteomics. CELL REPORTS. PHYSICAL SCIENCE 2022; 3:101069. [PMID: 36381226 PMCID: PMC9648337 DOI: 10.1016/j.xcrp.2022.101069] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The fields of proteomics and machine learning are both large disciplines, each producing well over 5,000 publications per year. However, studies combining both fields are still relatively rare, with only about 2% of recent proteomics papers including machine learning. This review, which focuses on the intersection of the fields, is intended to inspire proteomics researchers to develop skills and knowledge in the application of machine learning. A brief tutorial introduction to machine learning is provided, and research advances that rely on both fields, particularly as they relate to proteomics tools development and biomarker discovery, are highlighted. Key knowledge gaps and opportunities for scientific advancement are also enumerated.
Collapse
Affiliation(s)
- Heather Desaire
- Department of Chemistry, University of Kansas, Lawrence, KS 66045, USA
| | - Eden P. Go
- Department of Chemistry, University of Kansas, Lawrence, KS 66045, USA
| | - David Hua
- Department of Chemistry, University of Kansas, Lawrence, KS 66045, USA
| |
Collapse
|
9
|
Pauwels J, Fijałkowska D, Eyckerman S, Gevaert K. Mass spectrometry and the cellular surfaceome. MASS SPECTROMETRY REVIEWS 2022; 41:804-841. [PMID: 33655572 DOI: 10.1002/mas.21690] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Revised: 02/05/2021] [Accepted: 02/09/2021] [Indexed: 06/12/2023]
Abstract
The collection of exposed plasma membrane proteins, collectively termed the surfaceome, is involved in multiple vital cellular processes, such as the communication of cells with their surroundings and the regulation of transport across the lipid bilayer. The surfaceome also plays key roles in the immune system by recognizing and presenting antigens, with its possible malfunctioning linked to disease. Surface proteins have long been explored as potential cell markers, disease biomarkers, and therapeutic drug targets. Despite its importance, a detailed study of the surfaceome continues to pose major challenges for mass spectrometry-driven proteomics due to the inherent biophysical characteristics of surface proteins. Their inefficient extraction from hydrophobic membranes to an aqueous medium and their lower abundance compared to intracellular proteins hamper the analysis of surface proteins, which are therefore usually underrepresented in proteomic datasets. To tackle such problems, several innovative analytical methodologies have been developed. This review aims at providing an extensive overview of the different methods for surfaceome analysis, with respective considerations for downstream mass spectrometry-based proteomics.
Collapse
Affiliation(s)
- Jarne Pauwels
- VIB Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | | | - Sven Eyckerman
- VIB Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Kris Gevaert
- VIB Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| |
Collapse
|
10
|
Dunham SD, Sanders JD, Holden DD, Brodbelt JS. Improving the Center Section Sequence Coverage of Large Proteins Using Stepped-Fragment Ion Protection Ultraviolet Photodissociation. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2022; 33:446-456. [PMID: 35119856 DOI: 10.1021/jasms.1c00296] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Ultraviolet photodissociation (UVPD) mass spectrometry has gained attention in recent years for its ability to provide high sequence coverage of intact proteins. However, secondary dissociation of fragment ions, in which fragment ions subjected to multiple laser pulses decompose into small products, is a common phenomenon during UVPD that contributes to limited coverage in the midsection of protein sequences. To counter secondary dissociation, a method involving the application of notched waveforms to modulate the trajectories of fragment ions away from the laser beam, termed fragment ion protection (FIP), was previously developed to reduce the probability of secondary dissociation. This, in turn, increased the number of identified large fragment ions. In the present study, FIP was applied to UVPD of large proteins ranging in size from 29 to 55 kDa, enhancing the abundances of large fragment ions. A stepped-FIP strategy was implemented in which UVPD mass spectra were collected using multiple different amplitudes of the FIP waveforms and then the results from the mass spectra were combined. By using stepped-FIP, the number of fragment ions in the midsections of the sequences increased for all proteins. For example, whereas no fragment ions were identified in the middle section of the sequence for glutamate dehydrogenase (55 kDa, 55+ charge state), 10 sequence ions were identified by using UVPD-FIP.
Collapse
Affiliation(s)
- Sean D Dunham
- Department of Chemistry, University of Texas, Austin, Texas 78712, United States
| | - James D Sanders
- Department of Chemistry, University of Texas, Austin, Texas 78712, United States
| | - Dustin D Holden
- Department of Chemistry, University of Texas, Austin, Texas 78712, United States
| | - Jennifer S Brodbelt
- Department of Chemistry, University of Texas, Austin, Texas 78712, United States
| |
Collapse
|
11
|
Detection of Fusobacterium nucleatum subspecies in the saliva of pre-colorectal cancer patients, using tandem mass spectrometry. Arch Oral Biol 2021; 134:105337. [PMID: 34929558 DOI: 10.1016/j.archoralbio.2021.105337] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 11/25/2021] [Accepted: 12/06/2021] [Indexed: 12/12/2022]
Abstract
OBJECTIVE Rising evidence links Fusobacterium nucleatum (F. nucleatum) with its four subspecies; nucleatum, polymorphum, animalis, and vincentii, with the development of colorectal cancer (CRC) and its precursor colorectal adenoma (CRA). This study aims to optimize a technique for and explore the capability of matrix-assisted laser-desorption ionization-tandem time-of-flight mass spectrometry (MALDI-TOF/TOF MS) to detect F. nucleatum subspecies directly from the saliva samples of CRA patients and controls without preculturing. DESIGN Saliva samples were collected from four CRA patients and eight controls. Proteins were extracted and subjected to solid-phase extraction fractionation, enzymatically digested, and analyzed by MALDI-TOF/TOF MS. F. nucleatum subspecies strains were cultured and used as a positive control. RESULTS A proteomics approach was developed to identify F. nucleatum subspecies directly from saliva samples. With this approach, the bacterial culturing step, which could take up to seven days, was bypassed. Overall, 157 F. nucleatum subspecies proteins were detected in the saliva samples. F. nucleatum subsp. nucleatum was absent in the patients while detected in half of the controls. CONCLUSION This study presents a novel technique for detecting F. nucleatum subspecies from saliva specimens that could later be employed to better understand a potential role of those subspecies in CRC development.
Collapse
|
12
|
Roberts DS, Mann M, Melby JA, Larson EJ, Zhu Y, Brasier AR, Jin S, Ge Y. Structural O-Glycoform Heterogeneity of the SARS-CoV-2 Spike Protein Receptor-Binding Domain Revealed by Top-Down Mass Spectrometry. J Am Chem Soc 2021; 143:12014-12024. [PMID: 34328324 PMCID: PMC8353889 DOI: 10.1021/jacs.1c02713] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) utilizes an extensively glycosylated surface spike (S) protein to mediate host cell entry, and the S protein glycosylation plays key roles in altering the viral binding/function and infectivity. However, the molecular structures and glycan heterogeneity of the new O-glycans found on the S protein regional-binding domain (S-RBD) remain cryptic because of the challenges in intact glycoform analysis by conventional bottom-up glycoproteomic approaches. Here, we report the complete structural elucidation of intact O-glycan proteoforms through a hybrid native and denaturing top-down mass spectrometry (MS) approach employing both trapped ion mobility spectrometry (TIMS) quadrupole time-of-flight and ultrahigh-resolution Fourier transform ion cyclotron resonance (FTICR)-MS. Native top-down TIMS-MS/MS separates the protein conformers of the S-RBD to reveal their gas-phase structural heterogeneity, and top-down FTICR-MS/MS provides in-depth glycoform analysis for unambiguous identification of the glycan structures and their glycosites. A total of eight O-glycoforms and their relative molecular abundance are structurally elucidated for the first time. These findings demonstrate that this hybrid top-down MS approach can provide a high-resolution proteoform-resolved mapping of diverse O-glycoforms of the S glycoprotein, which lays a strong molecular foundation to uncover the functional roles of their O-glycans. This proteoform-resolved approach can be applied to reveal the structural O-glycoform heterogeneity of emergent SARS-CoV-2 S-RBD variants as well as other O-glycoproteins in general.
Collapse
Affiliation(s)
- David S Roberts
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Morgan Mann
- Department of Medicine, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Jake A Melby
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Eli J Larson
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Yanlong Zhu
- Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Allan R Brasier
- Department of Medicine, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Institute for Clinical and Translational Research, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Song Jin
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Ying Ge
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States.,Department of Medicine, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| |
Collapse
|
13
|
Fang L, Wang X. COVID-19 deep classification network based on convolution and deconvolution local enhancement. Comput Biol Med 2021; 135:104588. [PMID: 34182330 PMCID: PMC8216864 DOI: 10.1016/j.compbiomed.2021.104588] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 05/22/2021] [Accepted: 06/16/2021] [Indexed: 12/11/2022]
Abstract
Computer Tomography (CT) detection can effectively overcome the problems of traditional detection of Corona Virus Disease 2019 (COVID-19), such as lagging detection results and wrong diagnosis results, which lead to the increase of disease infection rate and prevalence rate. The novel coronavirus pneumonia is a significant difference between the positive and negative patients with asymptomatic infections. To effectively improve the accuracy of doctors' manual judgment of positive and negative COVID-19, this paper proposes a deep classification network model of the novel coronavirus pneumonia based on convolution and deconvolution local enhancement. Through convolution and deconvolution operation, the contrast between the local lesion region and the abdominal cavity of COVID-19 is enhanced. Besides, the middle-level features that can effectively distinguish the image types are obtained. By transforming the novel coronavirus detection problem into the region of interest (ROI) feature classification problem, it can effectively determine whether the feature vector in each feature channel contains the image features of COVID-19. This paper uses an open-source COVID-CT dataset provided by Petuum researchers from the University of California, San Diego, which is collected from 143 novel coronavirus pneumonia patients and the corresponding features are preserved. The complete dataset (including original image and enhanced image) contains 1460 images. Among them, 1022 (70%) and 438 (30%) are used to train and test the performance of the proposed model, respectively. The proposed model verifies the classification precision in different convolution layers and learning rates. Besides, it is compared with most state-of-the-art models. It is found that the proposed algorithm has good classification performance. The corresponding sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and precision are 0.98, 0.96, 0.98, and 0.97, respectively.
Collapse
Affiliation(s)
- Lingling Fang
- Department of Computing and Information Technology, Liaoning Normal University, Dalian City, Liaoning Province, China.
| | - Xin Wang
- Department of Computing and Information Technology, Liaoning Normal University, Dalian City, Liaoning Province, China
| |
Collapse
|
14
|
Melby JA, Roberts DS, Larson EJ, Brown KA, Bayne EF, Jin S, Ge Y. Novel Strategies to Address the Challenges in Top-Down Proteomics. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:1278-1294. [PMID: 33983025 PMCID: PMC8310706 DOI: 10.1021/jasms.1c00099] [Citation(s) in RCA: 80] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Top-down mass spectrometry (MS)-based proteomics is a powerful technology for comprehensively characterizing proteoforms to decipher post-translational modifications (PTMs) together with genetic variations and alternative splicing isoforms toward a proteome-wide understanding of protein functions. In the past decade, top-down proteomics has experienced rapid growth benefiting from groundbreaking technological advances, which have begun to reveal the potential of top-down proteomics for understanding basic biological functions, unraveling disease mechanisms, and discovering new biomarkers. However, many challenges remain to be comprehensively addressed. In this Account & Perspective, we discuss the major challenges currently facing the top-down proteomics field, particularly in protein solubility, proteome dynamic range, proteome complexity, data analysis, proteoform-function relationship, and analytical throughput for precision medicine. We specifically review the major technology developments addressing these challenges with an emphasis on our research group's efforts, including the development of top-down MS-compatible surfactants for protein solubilization, functionalized nanoparticles for the enrichment of low-abundance proteoforms, strategies for multidimensional chromatography separation of proteins, and a new comprehensive user-friendly software package for top-down proteomics. We have also made efforts to connect proteoforms with biological functions and provide our visions on what the future holds for top-down proteomics.
Collapse
Affiliation(s)
- Jake A Melby
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - David S Roberts
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Eli J Larson
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Kyle A Brown
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
- Department of Surgery, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Elizabeth F Bayne
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Song Jin
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Ying Ge
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
- Human Proteomics Program, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| |
Collapse
|
15
|
Roberts DS, Mann MW, Melby JA, Larson EJ, Zhu Y, Brasier AR, Jin S, Ge Y. Structural O-Glycoform Heterogeneity of the SARS-CoV-2 Spike Protein Receptor-Binding Domain Revealed by Native Top-Down Mass Spectrometry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021. [PMID: 33688648 DOI: 10.1101/2021.02.28.433291] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) utilizes an extensively glycosylated surface spike (S) protein to mediate host cell entry and the S protein glycosylation is strongly implicated in altering viral binding/function and infectivity. However, the structures and relative abundance of the new O-glycans found on the S protein regional-binding domain (S-RBD) remain cryptic because of the challenges in intact glycoform analysis. Here, we report the complete structural characterization of intact O-glycan proteoforms using native top-down mass spectrometry (MS). By combining trapped ion mobility spectrometry (TIMS), which can separate the protein conformers of S-RBD and analyze their gas phase structural variants, with ultrahigh-resolution Fourier transform ion cyclotron resonance (FTICR) MS analysis, the O-glycoforms of the S-RBD are comprehensively characterized, so that seven O-glycoforms and their relative molecular abundance are structurally elucidated for the first time. These findings demonstrate that native top-down MS can provide a high-resolution proteoform-resolved mapping of diverse O-glycoforms of the S glycoprotein, which lays a strong molecular foundation to uncover the functional roles of their O-glycans. This proteoform-resolved approach can be applied to reveal the structural O-glycoform heterogeneity of emergent SARS-CoV-2 S-RBD variants, as well as other O-glycoproteins in general.
Collapse
|
16
|
Brown KA, Melby JA, Roberts DS, Ge Y. Top-down proteomics: challenges, innovations, and applications in basic and clinical research. Expert Rev Proteomics 2020; 17:719-733. [PMID: 33232185 PMCID: PMC7864889 DOI: 10.1080/14789450.2020.1855982] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 11/23/2020] [Indexed: 12/14/2022]
Abstract
Introduction- A better understanding of the underlying molecular mechanism of diseases is critical for developing more effective diagnostic tools and therapeutics toward precision medicine. However, many challenges remain to unravel the complex nature of diseases. Areas covered- Changes in protein isoform expression and post-translation modifications (PTMs) have gained recognition for their role in underlying disease mechanisms. Top-down mass spectrometry (MS)-based proteomics is increasingly recognized as an important method for the comprehensive characterization of proteoforms that arise from alternative splicing events and/or PTMs for basic and clinical research. Here, we review the challenges, technological innovations, and recent studies that utilize top-down proteomics to elucidate changes in the proteome with an emphasis on its use to study heart diseases. Expert opinion- Proteoform-resolved information can substantially contribute to the understanding of the molecular mechanisms underlying various diseases and for the identification of novel proteoform targets for better therapeutic development . Despite the challenges of sequencing intact proteins, top-down proteomics has enabled a wealth of information regarding protein isoform switching and changes in PTMs. Continuous developments in sample preparation, intact protein separation, and instrumentation for top-down MS have broadened its capabilities to characterize proteoforms from a range of samples on an increasingly global scale.
Collapse
Affiliation(s)
- Kyle A. Brown
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
| | - Jake A. Melby
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
| | - David S. Roberts
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
| | - Ying Ge
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin, United States
- Human Proteomics Program, University of Wisconsin-Madison, Madison, Wisconsin, United States
| |
Collapse
|
17
|
Peris-Díaz MD, Guran R, Zitka O, Adam V, Krężel A. Mass Spectrometry-Based Structural Analysis of Cysteine-Rich Metal-Binding Sites in Proteins with MetaOdysseus R Software. J Proteome Res 2020; 20:776-785. [PMID: 32924499 PMCID: PMC7786378 DOI: 10.1021/acs.jproteome.0c00651] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
![]()
Identification
of metal-binding sites in proteins and understanding
metal-coupled protein folding mechanisms are aspects of high importance
for the structure-to-function relationship. Mass spectrometry (MS)
has brought a powerful adjunct perspective to structural biology,
obtaining from metal-to-protein stoichiometry to quaternary structure
information. Currently, the different experimental and/or instrumental
setups usually require the use of multiple data analysis software,
and in some cases, they lack some of the main data analysis steps
(MS processing, scoring, identification). Here, we present a comprehensive
data analysis pipeline that addresses charge-state deconvolution,
statistical scoring, and mass assignment for native MS, bottom-up,
and native top-down with emphasis on metal–protein complexes.
We have evaluated all of the approaches using assemblies of increasing
complexity, including free and chemically labeled proteins, from low-
to high-resolution MS. In all cases, the results have been compared
with common software and proved how MetaOdysseus outperformed them.
Collapse
Affiliation(s)
- Manuel David Peris-Díaz
- Department of Chemical Biology, Faculty of Biotechnology, University of Wrocław, F. Joliot-Curie 14a, 50-383 Wrocław, Poland
| | - Roman Guran
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00 Brno, Czech Republic.,Central European Institute of Technology, Brno University of Technology, Purkynova 123, 612 00 Brno, Czech Republic
| | - Ondrej Zitka
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00 Brno, Czech Republic.,Central European Institute of Technology, Brno University of Technology, Purkynova 123, 612 00 Brno, Czech Republic
| | - Vojtech Adam
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00 Brno, Czech Republic.,Central European Institute of Technology, Brno University of Technology, Purkynova 123, 612 00 Brno, Czech Republic
| | - Artur Krężel
- Department of Chemical Biology, Faculty of Biotechnology, University of Wrocław, F. Joliot-Curie 14a, 50-383 Wrocław, Poland
| |
Collapse
|
18
|
Wu Z, Roberts DS, Melby JA, Wenger K, Wetzel M, Gu Y, Ramanathan SG, Bayne EF, Liu X, Sun R, Ong IM, McIlwain SJ, Ge Y. MASH Explorer: A Universal Software Environment for Top-Down Proteomics. J Proteome Res 2020; 19:3867-3876. [PMID: 32786689 DOI: 10.1021/acs.jproteome.0c00469] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Top-down mass spectrometry (MS)-based proteomics enable a comprehensive analysis of proteoforms with molecular specificity to achieve a proteome-wide understanding of protein functions. However, the lack of a universal software for top-down proteomics is becoming increasingly recognized as a major barrier, especially for newcomers. Here, we have developed MASH Explorer, a universal, comprehensive, and user-friendly software environment for top-down proteomics. MASH Explorer integrates multiple spectral deconvolution and database search algorithms into a single, universal platform which can process top-down proteomics data from various vendor formats, for the first time. It addresses the urgent need in the rapidly growing top-down proteomics community and is freely available to all users worldwide. With the critical need and tremendous support from the community, we envision that this MASH Explorer software package will play an integral role in advancing top-down proteomics to realize its full potential for biomedical research.
Collapse
Affiliation(s)
- Zhijie Wu
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - David S Roberts
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Jake A Melby
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Kent Wenger
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Molly Wetzel
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Yiwen Gu
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | | | - Elizabeth F Bayne
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Xiaowen Liu
- Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, Indiana 46202, United States.,Center for Computational Biology and Bioinformatics, Indiana University-Purdue University Indianapolis, Indianapolis, Indiana 46202, United States
| | - Ruixiang Sun
- National Institute of Biological Sciences, Beijing, 102206, China
| | - Irene M Ong
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,University of Wisconsin Carbone Cancer Center, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Department of Obstetrics and Gynecology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Sean J McIlwain
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,University of Wisconsin Carbone Cancer Center, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| | - Ying Ge
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States.,Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, United States
| |
Collapse
|