1
|
Kuhnen G, Class LC, Badekow S, Hanisch KL, Rohn S, Kuballa J. Python workflow for the selection and identification of marker peptides-proof-of-principle study with heated milk. Anal Bioanal Chem 2024; 416:3349-3360. [PMID: 38607384 PMCID: PMC11106092 DOI: 10.1007/s00216-024-05286-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 03/26/2024] [Accepted: 04/02/2024] [Indexed: 04/13/2024]
Abstract
The analysis of almost holistic food profiles has developed considerably over the last years. This has also led to larger amounts of data and the ability to obtain more information about health-beneficial and adverse constituents in food than ever before. Especially in the field of proteomics, software is used for evaluation, and these do not provide specific approaches for unique monitoring questions. An additional and more comprehensive way of evaluation can be done with the programming language Python. It offers broad possibilities by a large ecosystem for mass spectrometric data analysis, but needs to be tailored for specific sets of features, the research questions behind. It also offers the applicability of various machine-learning approaches. The aim of the present study was to develop an algorithm for selecting and identifying potential marker peptides from mass spectrometric data. The workflow is divided into three steps: (I) feature engineering, (II) chemometric data analysis, and (III) feature identification. The first step is the transformation of the mass spectrometric data into a structure, which enables the application of existing data analysis packages in Python. The second step is the data analysis for selecting single features. These features are further processed in the third step, which is the feature identification. The data used exemplarily in this proof-of-principle approach was from a study on the influence of a heat treatment on the milk proteome/peptidome.
Collapse
Affiliation(s)
- Gesine Kuhnen
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
- Department of Food Chemistry and Analysis, Institute of Food Technology and Food Chemistry, Technical University Berlin, Gustav Meyer Allee 25, 13355, Berlin, Germany
| | - Lisa-Carina Class
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
- Hamburg School of Food Science, Institute of Food Chemistry, University of Hamburg, Grindelallee 117, 20146, Hamburg, Germany
| | - Svenja Badekow
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
| | - Kim Lara Hanisch
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany
| | - Sascha Rohn
- Department of Food Chemistry and Analysis, Institute of Food Technology and Food Chemistry, Technical University Berlin, Gustav Meyer Allee 25, 13355, Berlin, Germany
| | - Jürgen Kuballa
- GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany.
| |
Collapse
|
2
|
Böttcher B, Kienast SD, Leufken J, Eggers C, Sharma P, Leufken CM, Morgner B, Drexler HCA, Schulz D, Allert S, Jacobsen ID, Vylkova S, Leidel SA, Brunke S. A highly conserved tRNA modification contributes to C. albicans filamentation and virulence. Microbiol Spectr 2024; 12:e0425522. [PMID: 38587411 PMCID: PMC11064501 DOI: 10.1128/spectrum.04255-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 01/18/2024] [Indexed: 04/09/2024] Open
Abstract
tRNA modifications play important roles in maintaining translation accuracy in all domains of life. Disruptions in the tRNA modification machinery, especially of the anticodon stem loop, can be lethal for many bacteria and lead to a broad range of phenotypes in baker's yeast. Very little is known about the function of tRNA modifications in host-pathogen interactions, where rapidly changing environments and stresses require fast adaptations. We found that two closely related fungal pathogens of humans, the highly pathogenic Candida albicans and its much less pathogenic sister species, Candida dubliniensis, differ in the function of a tRNA-modifying enzyme. This enzyme, Hma1, exhibits species-specific effects on the ability of the two fungi to grow in the hypha morphology, which is central to their virulence potential. We show that Hma1 has tRNA-threonylcarbamoyladenosine dehydratase activity, and its deletion alters ribosome occupancy, especially at 37°C-the body temperature of the human host. A C. albicans HMA1 deletion mutant also shows defects in adhesion to and invasion into human epithelial cells and shows reduced virulence in a fungal infection model. This links tRNA modifications to host-induced filamentation and virulence of one of the most important fungal pathogens of humans.IMPORTANCEFungal infections are on the rise worldwide, and their global burden on human life and health is frequently underestimated. Among them, the human commensal and opportunistic pathogen, Candida albicans, is one of the major causative agents of severe infections. Its virulence is closely linked to its ability to change morphologies from yeasts to hyphae. Here, this ability is linked-to our knowledge for the first time-to modifications of tRNA and translational efficiency. One tRNA-modifying enzyme, Hma1, plays a specific role in C. albicans and its ability to invade the host. This adds a so-far unknown layer of regulation to the fungal virulence program and offers new potential therapeutic targets to fight fungal infections.
Collapse
Affiliation(s)
- Bettina Böttcher
- Department of Microbial Pathogenicity Mechanisms, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
- Septomics Research Center, Friedrich Schiller University and Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
| | - Sandra D. Kienast
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany
- Research Group for Cellular RNA Biochemistry, Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Bern, Switzerland
| | - Johannes Leufken
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany
- Research Group for Cellular RNA Biochemistry, Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Bern, Switzerland
| | - Cristian Eggers
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany
- Research Group for Cellular RNA Biochemistry, Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Bern, Switzerland
| | - Puneet Sharma
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany
- Research Group for Cellular RNA Biochemistry, Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Bern, Switzerland
| | - Christine M. Leufken
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany
| | - Bianka Morgner
- Department of Microbial Pathogenicity Mechanisms, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
| | - Hannes C. A. Drexler
- Bioanalytical Mass Spectrometry Unit, Max Planck Institute for Molecular Biomedicine, Münster, Germany
| | - Daniela Schulz
- Department of Microbial Pathogenicity Mechanisms, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
| | - Stefanie Allert
- Department of Microbial Pathogenicity Mechanisms, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
| | - Ilse D. Jacobsen
- Research Group Microbial Immunology, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
- Institute of Microbiology, Friedrich Schiller University, Jena, Germany
| | - Slavena Vylkova
- Septomics Research Center, Friedrich Schiller University and Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
| | - Sebastian A. Leidel
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Münster, Germany
- Research Group for Cellular RNA Biochemistry, Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Bern, Switzerland
| | - Sascha Brunke
- Department of Microbial Pathogenicity Mechanisms, Leibniz Institute for Natural Product Research and Infection Biology – Hans Knoell Institute, Jena, Germany
| |
Collapse
|
3
|
Picciani M, Gabriel W, Giurcoiu VG, Shouman O, Hamood F, Lautenbacher L, Jensen CB, Müller J, Kalhor M, Soleymaniniya A, Kuster B, The M, Wilhelm M. Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit. Proteomics 2024; 24:e2300112. [PMID: 37672792 DOI: 10.1002/pmic.202300112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/17/2023] [Accepted: 08/18/2023] [Indexed: 09/08/2023]
Abstract
Machine learning (ML) and deep learning (DL) models for peptide property prediction such as Prosit have enabled the creation of high quality in silico reference libraries. These libraries are used in various applications, ranging from data-independent acquisition (DIA) data analysis to data-driven rescoring of search engine results. Here, we present Oktoberfest, an open source Python package of our spectral library generation and rescoring pipeline originally only available online via ProteomicsDB. Oktoberfest is largely search engine agnostic and provides access to online peptide property predictions, promoting the adoption of state-of-the-art ML/DL models in proteomics analysis pipelines. We demonstrate its ability to reproduce and even improve our results from previously published rescoring analyses on two distinct use cases. Oktoberfest is freely available on GitHub (https://github.com/wilhelm-lab/oktoberfest) and can easily be installed locally through the cross-platform PyPI Python package.
Collapse
Affiliation(s)
- Mario Picciani
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Wassim Gabriel
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Victor-George Giurcoiu
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Omar Shouman
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Firas Hamood
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Ludwig Lautenbacher
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Cecilia Bang Jensen
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Julian Müller
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mostafa Kalhor
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Armin Soleymaniniya
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Matthew The
- Chair of Proteomics and Bioanalytics, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mathias Wilhelm
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| |
Collapse
|
4
|
Schiller H, Hong Y, Kouassi J, Rados T, Kwak J, DiLucido A, Safer D, Marchfelder A, Pfeiffer F, Bisson A, Schulze S, Pohlschroder M. Identification of structural and regulatory cell-shape determinants in Haloferax volcanii. Nat Commun 2024; 15:1414. [PMID: 38360755 PMCID: PMC10869688 DOI: 10.1038/s41467-024-45196-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 01/16/2024] [Indexed: 02/17/2024] Open
Abstract
Archaea play indispensable roles in global biogeochemical cycles, yet many crucial cellular processes, including cell-shape determination, are poorly understood. Haloferax volcanii, a model haloarchaeon, forms rods and disks, depending on growth conditions. Here, we used a combination of iterative proteomics, genetics, and live-cell imaging to identify mutants that only form rods or disks. We compared the proteomes of the mutants with wild-type cells across growth phases, thereby distinguishing between protein abundance changes specific to cell shape and those related to growth phases. The results identified a diverse set of proteins, including predicted transporters, transducers, signaling components, and transcriptional regulators, as important for cell-shape determination. Through phenotypic characterization of deletion strains, we established that rod-determining factor A (RdfA) and disk-determining factor A (DdfA) are required for the formation of rods and disks, respectively. We also identified structural proteins, including an actin homolog that plays a role in disk-shape morphogenesis, which we named volactin. Using live-cell imaging, we determined volactin's cellular localization and showed its dynamic polymerization and depolymerization. Our results provide insights into archaeal cell-shape determination, with possible implications for understanding the evolution of cell morphology regulation across domains.
Collapse
Affiliation(s)
- Heather Schiller
- University of Pennsylvania, Department of Biology, Philadelphia, PA, 19104, USA
| | - Yirui Hong
- University of Pennsylvania, Department of Biology, Philadelphia, PA, 19104, USA
| | - Joshua Kouassi
- University of Pennsylvania, Department of Biology, Philadelphia, PA, 19104, USA
| | - Theopi Rados
- Brandeis University, Department of Biology, Waltham, MA, 02453, USA
| | - Jasmin Kwak
- Brandeis University, Department of Biology, Waltham, MA, 02453, USA
| | - Anthony DiLucido
- University of Pennsylvania, Department of Biology, Philadelphia, PA, 19104, USA
| | - Daniel Safer
- University of Pennsylvania, Department of Physiology, Philadelphia, PA, 19104, USA
| | | | - Friedhelm Pfeiffer
- Biology II, Ulm University, 89069, Ulm, Germany
- Computational Biology Group, Max Planck Institute of Biochemistry, 82152, Martinsried, Germany
| | - Alexandre Bisson
- Brandeis University, Department of Biology, Waltham, MA, 02453, USA.
| | - Stefan Schulze
- University of Pennsylvania, Department of Biology, Philadelphia, PA, 19104, USA.
- Rochester Institute of Technology, Thomas H. Gosnell School of Life Sciences, Rochester, NY, 14623, USA.
| | | |
Collapse
|
5
|
Hellmann MJ, Moerschbacher BM, Cord-Landwehr S. Fast insights into chitosan-cleaving enzymes by simultaneous analysis of polymers and oligomers through size exclusion chromatography. Sci Rep 2024; 14:3417. [PMID: 38341520 PMCID: PMC10858908 DOI: 10.1038/s41598-024-54002-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 02/07/2024] [Indexed: 02/12/2024] Open
Abstract
The thorough characterization of chitosan-cleaving enzymes is crucial to unveil structure-function relationships of this promising class of biomolecules for both, enzymatic fingerprinting analyses and to use the enzymes as biotechnological tools to produce tailor-made chitosans for diverse applications. Analyzing polymeric substrates as well as oligomeric products has been established as an effective way to understand the actions of enzymes, but it currently requires separate, rather laborious methods to obtain the full picture. Here, we present ultra high performance size exclusion chromatography coupled to refractive index and mass spectrometry detection (UHPSEC-RI-MS) as a straightforward method for the semi-quantitative analysis of chitosan oligomers of up to ten monomers in length. Additionally, the method allows to determine the average molecular weight of the remaining polymers and its distribution. By sampling live from an ongoing enzymatic reaction, UHPSEC-RI-MS offers the unique opportunity to analyze polymers and oligomers simultaneously-i.e., to monitor the molecular weight reduction of the polymeric substrate over the course of the digestion, while at the same time analyzing the emerging oligomeric products in a semi-quantitative manner. In this way, a single simple analysis yields detailed insights into an enzyme's action on a given substrate.
Collapse
Affiliation(s)
- Margareta J Hellmann
- Institute for Biology and Biotechnology of Plants, University of Münster, 48143, Münster, Germany
| | - Bruno M Moerschbacher
- Institute for Biology and Biotechnology of Plants, University of Münster, 48143, Münster, Germany.
| | - Stefan Cord-Landwehr
- Institute for Biology and Biotechnology of Plants, University of Münster, 48143, Münster, Germany
| |
Collapse
|
6
|
Chen X, Wu W, Sun H, Chen L, Wang Y, Xia B, Zhou Y. Development and Application of a Comprehensive Nontargeted Screening Strategy for Aristolochic Acid Analogues. Anal Chem 2024; 96:1922-1931. [PMID: 38264982 DOI: 10.1021/acs.analchem.3c04064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]
Abstract
Aristolochic acid analogs (AAAs) are naturally occurring carcinogenic and toxic compounds that pose a safety threat to pharmaceuticals and the environment. It is challenging to screen AAAs due to their lack of characteristic mass spectral fragmentation and their presence of structural diversity. A comprehensive nontargeted screening strategy was proposed by taking into account diverse factors and incorporating various self-developed techniques, and a Python3-based toolkit called AAAs_finder was developed for its implementation. The main procedures consist of virtual structure and ultraviolet and visible (UV) spectra database creation, exact mass and UV spectra-based suspect data extraction, tandem mass spectra (MS/MS) anthropomorphic interpretation, and multicondition retention time (RT) prediction-based candidate structures ranking. To initially assess screening feasibility, eight hypothetical unknown samples were subjected to nontargeted screening using the AAAs_finder toolkit and two other advanced tools. The results showed that the former successfully identified all, while the latter two only managed to identify two and three, respectively, indicating that our strategy was more feasible. After that, the strategy was carefully evaluated for false positives and false negatives, instrument dependence, reproducibility, and sensitivity. After the evaluation, the strategy was successfully applied to the screening of AAAs in real samples, such as herbal medicine, spiked soil, and water. Overall, this study proposed a nontargeted screening strategy and toolkit independent of characteristic mass spectral fragmentation and able to overcome challenges posed by structural diversity for the AAAs screening, which is also valuable for other classes of compounds.
Collapse
Affiliation(s)
- Xiaoqi Chen
- Chengdu Institute of Organic Chemistry, Chinese Academy of Sciences, Chengdu 610041, China
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wenlin Wu
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Chengdu Institute of Food Inspection, Chengdu 611130, China
- Key Laboratory of Chemical Metrology and Applications on Nutrition and Health for State Market Regulation, Beijing 100029, China
| | - Hongbing Sun
- Chengdu Institute of Organic Chemistry, Chinese Academy of Sciences, Chengdu 610041, China
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Sichuan Academy of Chinese Medicine Sciences, Chengdu 610041, China
| | - Lu Chen
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yu Wang
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bing Xia
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China
| | - Yan Zhou
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu 610041, China
| |
Collapse
|
7
|
Bonin M, Irion AL, Jürß A, Pascual S, Cord-Landwehr S, Planas A, Moerschbacher BM. Engineering of a chitin deacetylase to generate tailor-made chitosan polymers. PLoS Biol 2024; 22:e3002459. [PMID: 38236907 PMCID: PMC10796014 DOI: 10.1371/journal.pbio.3002459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 12/04/2023] [Indexed: 01/22/2024] Open
Abstract
Chitin deacetylases (CDAs) emerge as a valuable tool to produce chitosans with a nonrandom distribution of N-acetylglucosamine (GlcNAc) and glucosamine (GlcN) units. We hypothesized before that CDAs tend to bind certain sequences within the substrate matching their subsite preferences for either GlcNAc or GlcN units. Thus, they deacetylate or N-acetylate their substrates at nonrandom positions. To understand the molecular basis of these preferences, we analyzed the binding site of a CDA from Pestalotiopsis sp. (PesCDA) using a detailed activity screening of a site-saturation mutagenesis library. In addition, molecular dynamics simulations were conducted to get an in-depth view of crucial interactions along the binding site. Besides elucidating the function of several amino acids, we were able to show that only 3 residues are responsible for the highly specific binding of PesCDA to oligomeric substrates. The preference to bind a GlcNAc unit at subsite -2 and -1 can mainly be attributed to N75 and H199, respectively. Whereas an exchange of N75 at subsite -2 eliminates enzyme activity, H199 can be substituted with tyrosine to increase the GlcN acceptance at subsite -1. This change in substrate preference not only increases enzyme activity on certain substrates and changes composition of oligomeric products but also significantly changes the pattern of acetylation (PA) when N-acetylating polyglucosamine. Consequently, we could clearly show how subsite preferences influence the PA of chitosans produced with CDAs.
Collapse
Affiliation(s)
- Martin Bonin
- Institute for Biology and Biotechnology of Plants, University of Münster, Münster, Germany
- Laboratory of Biochemistry, Institut Químic de Sarrià, University Ramon Llull, Barcelona, Spain
| | - Antonia L. Irion
- Institute for Biology and Biotechnology of Plants, University of Münster, Münster, Germany
| | - Anika Jürß
- Institute for Biology and Biotechnology of Plants, University of Münster, Münster, Germany
| | - Sergi Pascual
- Laboratory of Biochemistry, Institut Químic de Sarrià, University Ramon Llull, Barcelona, Spain
| | - Stefan Cord-Landwehr
- Institute for Biology and Biotechnology of Plants, University of Münster, Münster, Germany
| | - Antoni Planas
- Laboratory of Biochemistry, Institut Químic de Sarrià, University Ramon Llull, Barcelona, Spain
| | - Bruno M. Moerschbacher
- Institute for Biology and Biotechnology of Plants, University of Münster, Münster, Germany
| |
Collapse
|
8
|
Qian Y, Guo X, Wang Y, Ouyang Z, Ma X. Mobility-Modulated Sequential Dissociation Analysis Enables Structural Lipidomics in Mass Spectrometry Imaging. Angew Chem Int Ed Engl 2023; 62:e202312275. [PMID: 37946693 DOI: 10.1002/anie.202312275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 10/09/2023] [Accepted: 11/09/2023] [Indexed: 11/12/2023]
Abstract
Spatial lipidomics based on mass spectrometry imaging (MSI) is a powerful tool for fundamental biology studies and biomarker discovery. But the structure-resolving capability of MSI is limited because of the lack of multiplexed tandem mass spectrometry (MS/MS) method, primarily due to the small sample amount available from each pixel and the poor ion usage in MS/MS analysis. Here, we report a mobility-modulated sequential dissociation (MMSD) strategy for multiplex MS/MS imaging of distinct lipids from biological tissues. With ion mobility-enabled data-independent acquisition and automated spectrum deconvolution, MS/MS spectra of a large number of lipid species from each tissue pixel are acquired, at no expense of imaging speed. MMSD imaging is highlighted by MS/MS imaging of 24 structurally distinct lipids in the mouse brain and the revealing of the correlation of a structurally distinct phosphatidylethanolamine isomer (PE 18 : 1_18 : 1) from a human hepatocellular carcinoma (HCC) tissue. Mapping of structurally distinct lipid isomers is now enabled and spatial lipidomics becomes feasible for MSI.
Collapse
Affiliation(s)
- Yao Qian
- State Key Laboratory of Precision Measurement Technology and Instruments, Department of Precision Instrument, Tsinghua University, Beijing, 100084, China
| | - Xiangyu Guo
- State Key Laboratory of Precision Measurement Technology and Instruments, Department of Precision Instrument, Tsinghua University, Beijing, 100084, China
| | - Yunfang Wang
- Hepato-pancreato-biliary Center, Beijing Tsinghua Changgung Hospital, Tsinghua University, Beijing, 102218, China
| | - Zheng Ouyang
- State Key Laboratory of Precision Measurement Technology and Instruments, Department of Precision Instrument, Tsinghua University, Beijing, 100084, China
| | - Xiaoxiao Ma
- State Key Laboratory of Precision Measurement Technology and Instruments, Department of Precision Instrument, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
9
|
Morosi L, Miotto M, Timo S, Carloni S, Bruno E, Meroni M, Menna E, Lodato S, Rescigno M, Martano G. MSIpixel: a fully automated pipeline for compound annotation and quantitation in mass spectrometry imaging experiments. Brief Bioinform 2023; 25:bbad463. [PMID: 38102070 PMCID: PMC10753532 DOI: 10.1093/bib/bbad463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 10/13/2023] [Accepted: 11/22/2023] [Indexed: 12/17/2023] Open
Abstract
Mass spectrometry imaging (MSI) is commonly used to map the spatial distribution of small molecules within complex biological matrices. One of the major challenges in imaging MS-based spatial metabolomics is molecular identification and metabolite annotation, to address this limitation, annotation is often complemented with parallel bulk LC-MS2-based metabolomics to confirm and validate identifications. Here we applied MSI method, utilizing data-dependent acquisition, to visualize and identify unknown molecules in a single instrument run. To reach this aim we developed MSIpixel, a fully automated pipeline for compound annotation and quantitation in MSI experiments. It overcomes challenges in molecular identification, and improving reliability and comprehensiveness in MSI-based spatial metabolomics.
Collapse
Affiliation(s)
- Lavinia Morosi
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
| | - Matteo Miotto
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
| | - Sara Timo
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
| | - Sara Carloni
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
- Department of Biomedical Sciences, Humanitas University, Via R. Levi Montalcini 4, Pieve Emanuele (MI), Italy
| | - Eleonora Bruno
- Department of Experimental Oncology, Fondazione IRCCS Istituto Nazionale dei tumori di Milano. Via Venezian 1, Milan, Italy
| | - Marina Meroni
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Department of Oncology, via Mario Negri 2, Milan, Italy
| | - Elisabetta Menna
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
- Institute of Neuroscience, National Research Council of Italy (CNR) c/o Humanitas Mirasole S.p.A, Via Manzoni 56, Rozzano (MI), Italy
| | - Simona Lodato
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
- Department of Biomedical Sciences, Humanitas University, Via R. Levi Montalcini 4, Pieve Emanuele (MI), Italy
| | - Maria Rescigno
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
- Department of Biomedical Sciences, Humanitas University, Via R. Levi Montalcini 4, Pieve Emanuele (MI), Italy
| | - Giuseppe Martano
- IRCCS Humanitas Research Hospital, Via Manzoni 56, Rozzano (MI), Italy
- Institute of Neuroscience, National Research Council of Italy (CNR) c/o Humanitas Mirasole S.p.A, Via Manzoni 56, Rozzano (MI), Italy
| |
Collapse
|
10
|
Ross D, Bilbao A, Lee JY, Zheng X. mzapy: An Open-Source Python Library Enabling Efficient Extraction and Processing of Ion Mobility Spectrometry-Mass Spectrometry Data in the MZA File Format. Anal Chem 2023. [PMID: 37307589 DOI: 10.1021/acs.analchem.3c01653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Analysis of ion mobility spectrometry (IMS) data has been challenging and limited the full utility of these measurements. Unlike liquid chromatography-mass spectrometry, where a plethora of tools with well-established algorithms exist, the incorporation of the additional IMS dimension requires upgrading existing computational pipelines and developing new algorithms to fully exploit the advantages of the technology. We have recently reported MZA, a new and simple mass spectrometry data structure based on the broadly supported HDF5 format and created to facilitate software development. While this format is inherently supportive of application development, the availability of core libraries in popular programming languages with standard mass spectrometry utilities will facilitate fast software development and broader adoption of the format. To this end, we present a Python package, mzapy, for efficient extraction and processing of mass spectrometry data in the MZA format, especially for complex data containing ion mobility spectrometry dimension. In addition to raw data extraction, mzapy contains supporting utilities enabling tasks including calibration, signal processing, peak finding, and generating plots. Being implemented in pure Python and having minimal and largely standardized dependencies makes mzapy uniquely suited to application development in the multiomics domain. The mzapy package is free and open-source, includes comprehensive documentation, and is structured to support future extension to meet the evolving needs of the MS community. The software source code is freely available at https://github.com/PNNL-m-q/mzapy.
Collapse
Affiliation(s)
- Dylan Ross
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Aivett Bilbao
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Joon-Yong Lee
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Xueyun Zheng
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| |
Collapse
|
11
|
Wisecaver JH, Auber RP, Pendleton AL, Watervoort NF, Fallon TR, Riedling OL, Manning SR, Moore BS, Driscoll WW. Extreme genome diversity and cryptic speciation in a harmful algal-bloom-forming eukaryote. Curr Biol 2023:S0960-9822(23)00597-3. [PMID: 37224809 DOI: 10.1016/j.cub.2023.05.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 03/14/2023] [Accepted: 05/02/2023] [Indexed: 05/26/2023]
Abstract
Harmful algal blooms of the toxic haptophyte Prymnesium parvum are a recurrent problem in many inland and estuarine waters around the world. Strains of P. parvum vary in the toxins they produce and in other physiological traits associated with harmful algal blooms, but the genetic basis for this variation is unknown. To investigate genome diversity in this morphospecies, we generated genome assemblies for 15 phylogenetically and geographically diverse strains of P. parvum, including Hi-C guided, near-chromosome-level assemblies for two strains. Comparative analysis revealed considerable DNA content variation between strains, ranging from 115 to 845 Mbp. Strains included haploids, diploids, and polyploids, but not all differences in DNA content were due to variation in genome copy number. Haploid genome size between strains of different chemotypes differed by as much as 243 Mbp. Syntenic and phylogenetic analyses indicate that UTEX 2797, a common laboratory strain from Texas, is a hybrid that retains two phylogenetically distinct haplotypes. Investigation of gene families variably present across the strains identified several functional categories associated with metabolic and genome size variation in P. parvum, including genes for the biosynthesis of toxic metabolites and proliferation of transposable elements. Together, our results indicate that P. parvum comprises multiple cryptic species. These genomes provide a robust phylogenetic and genomic framework for investigations into the eco-physiological consequences of the intra- and inter-specific genetic variation present in P. parvum and demonstrate the need for similar resources for other harmful algal-bloom-forming morphospecies.
Collapse
Affiliation(s)
- Jennifer H Wisecaver
- Department of Biochemistry, Purdue University, 175 S University St, West Lafayette, IN 47907, USA; Purdue Center for Plant Biology, Purdue University, 175 S University St, West Lafayette, IN 47907, USA.
| | - Robert P Auber
- Department of Biochemistry, Purdue University, 175 S University St, West Lafayette, IN 47907, USA; Purdue Center for Plant Biology, Purdue University, 175 S University St, West Lafayette, IN 47907, USA
| | - Amanda L Pendleton
- Department of Biochemistry, Purdue University, 175 S University St, West Lafayette, IN 47907, USA; Purdue Center for Plant Biology, Purdue University, 175 S University St, West Lafayette, IN 47907, USA
| | - Nathan F Watervoort
- Department of Biochemistry, Purdue University, 175 S University St, West Lafayette, IN 47907, USA; Purdue Center for Plant Biology, Purdue University, 175 S University St, West Lafayette, IN 47907, USA
| | - Timothy R Fallon
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography and University of California San Diego, 9500 Gilman Dr #0204, La Jolla, CA 92093, USA
| | - Olivia L Riedling
- Department of Biochemistry, Purdue University, 175 S University St, West Lafayette, IN 47907, USA; Purdue Center for Plant Biology, Purdue University, 175 S University St, West Lafayette, IN 47907, USA
| | - Schonna R Manning
- Department of Biological Sciences, Institute of Environment, Florida International University, 3000 NE 151st Street, MSB 250B, North Miami, FL 33181, USA
| | - Bradley S Moore
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography and University of California San Diego, 9500 Gilman Dr #0204, La Jolla, CA 92093, USA; Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California at San Diego, 9500 Gilman Dr #0204, La Jolla, CA 92093, USA
| | - William W Driscoll
- Department of Biology, Penn State Harrisburg, 777 W. Harrisburg Pike, Middletown, PA 17057, USA
| |
Collapse
|
12
|
Zhou S, Fatma Z, Xue P, Mishra S, Cao M, Zhao H, Sweedler JV. Mass Spectrometry-Based High-Throughput Quantification of Bioproducts in Liquid Culture. Anal Chem 2023; 95:4067-4076. [PMID: 36790390 DOI: 10.1021/acs.analchem.2c04845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
To meet the ever-increasing need for high-throughput screening in metabolic engineering, information-rich, fast screening methods are needed. Mass spectrometry (MS) provides an efficient and general approach for metabolite screening and offers the capability of characterizing a broad range of analytes in a label-free manner, but often requires a range of sample clean-up and extraction steps. Liquid extraction surface analysis (LESA) coupled MS is an image-guided MS surface analysis approach that directly samples and introduces metabolites from a surface to MS. Here, we combined the advantages of LESA-MS and an acoustic liquid handler with stable isotope-labeled internal standards. This approach provides absolute quantitation of target chemicals from liquid culture-dried droplets and enables high-throughput quantitative screening for microbial metabolites. In this study, LESA-MS was successfully applied to quantify several different metabolites (itaconic acid, triacetic acid lactone, and palmitic acid) from different yeast strains in different mediums, demonstrating its versatility, accuracy, and efficiency across a range of microbial engineering applications.
Collapse
Affiliation(s)
- Shuaizhen Zhou
- Department of Energy Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Zia Fatma
- Department of Energy Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Pu Xue
- Department of Energy Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Shekhar Mishra
- Department of Energy Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Mingfeng Cao
- Department of Energy Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Huimin Zhao
- Department of Energy Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Jonathan V Sweedler
- Department of Energy Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
13
|
Hermann M, Metwally H, Yu J, Smith R, Tomm H, Kaufmann M, Ren KYM, Liu C, LeBlanc Y, Covey TR, Ross AC, Oleschuk RD. 3D printer platform and conductance feedback loop for automated imaging of uneven surfaces by liquid microjunction-surface sampling probe mass spectrometry. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2023:e9492. [PMID: 36756683 DOI: 10.1002/rcm.9492] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/06/2023] [Accepted: 02/07/2023] [Indexed: 06/18/2023]
Abstract
RATIONALE Molecular imaging of samples using mass spectrometric techniques, such as matrix-assisted laser desorption ionization or desorption electrospray ionization, requires the sample surface to be even/flat and sliced into thin sections (c. 10 μm). Furthermore, sample preparation steps can alter the analyte composition of the sample. The liquid microjunction-surface sampling probe (LMJ-SSP) is a robust sampling interface that enables surface profiling with minimal sample preparation. In conjunction with a conductance feedback system, the LMJ-SSP can be used to automatically sample uneven specimens. METHODS A sampling stage was built with a modified 3D printer where the LMJ-SSP is attached to the printing head. This setup can scan across flat and even surfaces in a predefined pattern ("static sampling mode"). Uneven samples are automatically probed in "conductance sampling mode" where an electric potential is applied and measured at the probe. When the probe contacts the electrically grounded sample, the potential at the probe drops, which is used as a feedback signal to determine the optimal position of the probe for sampling each location. RESULTS The applicability of the probe/sensing system was demonstrated by first examining the strawberry tissue using the "static sampling mode." Second, porcine tissue samples were profiled using the "conductance sampling mode." With minimal sample preparation, an area of 11 × 15 mm was profiled in less than 2 h. From the obtained results, adipose areas could be distinguished from non-adipose parts. The versatility of the approach was further demonstrated by directly sampling the bacteria colonies on agar and resected human kidney (intratumoral hemorrhage) specimens with thicknesses ranging from 1 to 4 mm. CONCLUSION The LMJ-SSP in conjunction with a conductive feedback system is a powerful tool that allows for fast, reproducible, and automated assessment of uneven surfaces with minimal sample preparation. This setup could be used for perioperative assessment of tissue samples, food screening, and natural product discovery, among others.
Collapse
Affiliation(s)
- Matthias Hermann
- Department of Chemistry, Queen's University, Kingston, Ontario, Canada
| | - Haidy Metwally
- Department of Chemistry, Queen's University, Kingston, Ontario, Canada
| | - Jian Yu
- Department of Chemistry, Queen's University, Kingston, Ontario, Canada
| | - Rachael Smith
- Department of Chemistry, Queen's University, Kingston, Ontario, Canada
| | - Hailey Tomm
- Department of Chemistry, Queen's University, Kingston, Ontario, Canada
| | - Martin Kaufmann
- Department of Surgery, Queen's University, Kingston, Ontario, Canada
| | - Kevin Y M Ren
- Department of Pathology, Queen's University, Kingston, Ontario, Canada
| | | | | | | | - Avena C Ross
- Department of Chemistry, Queen's University, Kingston, Ontario, Canada
| | | |
Collapse
|
14
|
Bittremieux W, Levitsky L, Pilz M, Sachsenberg T, Huber F, Wang M, Dorrestein PC. Unified and Standardized Mass Spectrometry Data Processing in Python Using spectrum_utils. J Proteome Res 2023; 22:625-631. [PMID: 36688502 DOI: 10.1021/acs.jproteome.2c00632] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
spectrum_utils is a Python package for mass spectrometry data processing and visualization. Since its introduction, spectrum_utils has grown into a fundamental software solution that powers various applications in proteomics and metabolomics, ranging from spectrum preprocessing prior to spectrum identification and machine learning applications to spectrum plotting from online data repositories and assisting data analysis tasks for dozens of other projects. Here, we present updates to spectrum_utils, which include new functionality to integrate mass spectrometry community data standards, enhanced mass spectral data processing, and unified mass spectral data visualization in Python. spectrum_utils is freely available as open source at https://github.com/bittremieux/spectrum_utils.
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium.,Biomedical Informatics Network Antwerpen (biomina), 2020 Antwerp, Belgium
| | - Lev Levitsky
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Matteo Pilz
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
| | - Timo Sachsenberg
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
| | - Florian Huber
- Centre for Digitalisation and Digitality, University of Applied Sciences Düsseldorf, 40476 Düsseldorf, Germany
| | - Mingxun Wang
- Department of Computer Science, University of California─Riverside, Riverside, California 92507, United States
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, University of California─San Diego, La Jolla, California 92093, United States.,Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California─San Diego, La Jolla California 92093, United States
| |
Collapse
|
15
|
Bilbao A, Ross DH, Lee JY, Donor MT, Williams SM, Zhu Y, Ibrahim YM, Smith RD, Zheng X. MZA: A Data Conversion Tool to Facilitate Software Development and Artificial Intelligence Research in Multidimensional Mass Spectrometry. J Proteome Res 2023; 22:508-513. [PMID: 36414245 PMCID: PMC9898216 DOI: 10.1021/acs.jproteome.2c00313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Modern mass spectrometry-based workflows employing hybrid instrumentation and orthogonal separations collect multidimensional data, potentially allowing deeper understanding in omics studies through adoption of artificial intelligence methods. However, the large volume of these rich spectra challenges existing data storage and access technologies, therefore precluding informatics advancements. We present MZA (pronounced m-za), the mass-to-charge (m/z) generic data storage and access tool designed to facilitate software development and artificial intelligence research in multidimensional mass spectrometry measurements. Composed of a data conversion tool and a simple file structure based on the HDF5 format, MZA provides easy, cross-platform and cross-programming language access to raw MS-data, enabling fast development of new tools in data science programming languages such as Python and R. The software executable, example MS-data and example Python and R scripts are freely available at https://github.com/PNNL-m-q/mza.
Collapse
Affiliation(s)
- Aivett Bilbao
- Pacific Northwest National Laboratory, Richland, WA, 99352, USA,Corresponding authors Aivett Bilbao – Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, 99352, United States; .; Xueyun Zheng – Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, United States;
| | - Dylan H. Ross
- Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Joon-Yong Lee
- Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Micah T. Donor
- Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | | | - Ying Zhu
- Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | | | | | - Xueyun Zheng
- Pacific Northwest National Laboratory, Richland, WA, 99352, USA,Corresponding authors Aivett Bilbao – Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, 99352, United States; .; Xueyun Zheng – Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, United States;
| |
Collapse
|
16
|
Han Y, Wennersten SA, Wright JM, Ludwig RW, Lau E, Lam MPY. Proteogenomics reveals sex-biased aging genes and coordinated splicing in cardiac aging. Am J Physiol Heart Circ Physiol 2022; 323:H538-H558. [PMID: 35930447 PMCID: PMC9448281 DOI: 10.1152/ajpheart.00244.2022] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 07/20/2022] [Accepted: 07/31/2022] [Indexed: 01/24/2023]
Abstract
The risks of heart diseases are significantly modulated by age and sex, but how these factors influence baseline cardiac gene expression remains incompletely understood. Here, we used RNA sequencing and mass spectrometry to compare gene expression in female and male young adult (4 mo) and early aging (20 mo) mouse hearts, identifying thousands of age- and sex-dependent gene expression signatures. Sexually dimorphic cardiac genes are broadly distributed, functioning in mitochondrial metabolism, translation, and other processes. In parallel, we found over 800 genes with differential aging response between male and female, including genes in cAMP and PKA signaling. Analysis of the sex-adjusted aging cardiac transcriptome revealed a widespread remodeling of exon usage patterns that is largely independent from differential gene expression, concomitant with upstream changes in RNA-binding protein and splice factor transcripts. To evaluate the impact of the splicing events on cardiac proteoform composition, we applied an RNA-guided proteomics computational pipeline to analyze the mass spectrometry data and detected hundreds of putative splice variant proteins that have the potential to rewire the cardiac proteome. Taken together, the results here suggest that cardiac aging is associated with 1) widespread sex-biased aging genes and 2) a rewiring of RNA splicing programs, including sex- and age-dependent changes in exon usages and splice patterns that have the potential to influence cardiac protein structure and function. These changes contribute to the emerging evidence for considerable sexual dimorphism in the cardiac aging process that should be considered in the search for disease mechanisms.NEW & NOTEWORTHY Han et al. used proteogenomics to compare male and female mouse hearts at 4 and 20 mo. Sex-biased cardiac genes function in mitochondrial metabolism, translation, autophagy, and other processes. Hundreds of cardiac genes show sex-by-age interactions, that is, sex-biased aging genes. Cardiac aging is accompanied with a remodeling of exon usage in functionally coordinated genes, concomitant with differential expression of RNA-binding proteins and splice factors. These features represent an underinvestigated aspect of cardiac aging that may be relevant to the search for disease mechanisms.
Collapse
Grants
- R21-HL150456 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R00-HL144829 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R00 HL127302 NHLBI NIH HHS
- R03-OD032666 HHS | NIH | NIH Office of the Director (OD)
- R01 HL141278 NHLBI NIH HHS
- F32 HL149191 NHLBI NIH HHS
- F32-HL149191 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R00-HL127302 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R21 HL150456 NHLBI NIH HHS
- R03 OD032666 NIH HHS
- R00 HL144829 NHLBI NIH HHS
- R01-HL141278 HHS | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- University of Colorado
- University of Colorado School of Medicine, Anschutz Medical Campus
Collapse
Affiliation(s)
- Yu Han
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | - Sara A Wennersten
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | - Julianna M Wright
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | - R W Ludwig
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| | | | - Maggie P Y Lam
- Department of Medicine, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
- Department of Biochemistry and Molecular Genetics, Anschutz Medical Campus, University of Colorado School of Medicine, Aurora, Colorado
| |
Collapse
|
17
|
Harmonizing Labeling and Analytical Strategies to Obtain Protein Turnover Rates in Intact Adult Animals. Mol Cell Proteomics 2022; 21:100252. [PMID: 35636728 PMCID: PMC9249856 DOI: 10.1016/j.mcpro.2022.100252] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 03/29/2022] [Accepted: 05/25/2022] [Indexed: 12/02/2022] Open
Abstract
Changes in the abundance of individual proteins in the proteome can be elicited by modulation of protein synthesis (the rate of input of newly synthesized proteins into the protein pool) or degradation (the rate of removal of protein molecules from the pool). A full understanding of proteome changes therefore requires a definition of the roles of these two processes in proteostasis, collectively known as protein turnover. Because protein turnover occurs even in the absence of overt changes in pool abundance, turnover measurements necessitate monitoring the flux of stable isotope–labeled precursors through the protein pool such as labeled amino acids or metabolic precursors such as ammonium chloride or heavy water. In cells in culture, the ability to manipulate precursor pools by rapid medium changes is simple, but for more complex systems such as intact animals, the approach becomes more convoluted. Individual methods bring specific complications, and the suitability of different methods has not been comprehensively explored. In this study, we compare the turnover rates of proteins across four mouse tissues, obtained from the same inbred mouse strain maintained under identical husbandry conditions, measured using either [13C6]lysine or [2H2]O as the labeling precursor. We show that for long-lived proteins, the two approaches yield essentially identical measures of the first-order rate constant for degradation. For short-lived proteins, there is a need to compensate for the slower equilibration of lysine through the precursor pools. We evaluate different approaches to provide that compensation. We conclude that both labels are suitable, but careful determination of precursor enrichment kinetics in amino acid labeling is critical and has a considerable influence on the numerical values of the derived protein turnover rates. Controlled comparison of heavy water or amino acid labeling for protein turnover. Delays in amino acid precursor labeling mostly affect high turnover proteins Both methods produced similar turnover rates after adjustment of precursor kinetics. Recommendations for analytical workflows for protein turnover studies in animals.
Collapse
|
18
|
Schessner JP, Voytik E, Bludau I. A practical guide to interpreting and generating bottom-up proteomics data visualizations. Proteomics 2022; 22:e2100103. [PMID: 35107884 DOI: 10.1002/pmic.202100103] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 12/22/2021] [Accepted: 01/20/2022] [Indexed: 11/10/2022]
Abstract
Mass-spectrometry based bottom-up proteomics is the main method to analyze proteomes comprehensively and the rapid evolution of instrumentation and data analysis has made the technology widely available. Data visualization is an integral part of the analysis process and it is crucial for the communication of results. This is a major challenge due to the immense complexity of MS data. In this review, we provide an overview of commonly used visualizations, starting with raw data of traditional and novel MS technologies, then basic peptide and protein level analyses, and finally visualization of highly complex datasets and networks. We specifically provide guidance on how to critically interpret and discuss the multitude of different proteomics data visualizations. Furthermore, we highlight Python-based libraries and other open science tools that can be applied for independent and transparent generation of customized visualizations. To further encourage programmatic data visualization, we provide the Python code used to generate all data Figures in this review on GitHub (https://github.com/MannLabs/ProteomicsVisualization). This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Julia Patricia Schessner
- Max-Planck-Institute of Biochemistry, Department of Proteomics and Signal Transduction, Planegg, Germany
| | - Eugenia Voytik
- Max-Planck-Institute of Biochemistry, Department of Proteomics and Signal Transduction, Planegg, Germany
| | - Isabell Bludau
- Max-Planck-Institute of Biochemistry, Department of Proteomics and Signal Transduction, Planegg, Germany
| |
Collapse
|
19
|
Wang Y, Xia B, Deng S, Ye Y, Zhou Y. Performing 2D-1D-2D Mass Spectrometry Imaging Using Strings. Anal Chem 2022; 94:1661-1668. [PMID: 35029371 DOI: 10.1021/acs.analchem.1c04181] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The mass spectrometry imaging (MSI) technique is widely used in several fields due to its ability to provide spatial information of samples. However, for existing MSI methods, the sample is typically placed on a two-dimensional (2D) platform and is scanned back and forth. As a result, the platform size limits the imaging size. This paper proposes a new MSI method that involves the initial imprinting of chemicals on a two-dimensional string plane area. The string plane was then unraveled to a one-dimensional (1D) string, and the chemicals imprinted on it were ionized using a lab-made ion source. Finally, a 2D MSI image was reconstructed through data processing (2D-1D-2D mass imaging). Compared with traditional MSI methods, the imaging size is no longer limited by the platform size, making it possible to perform the MSI of large samples. As proof of concept, this method was used to image an intact seedling of Broussonetia papyrifera. As a result, clear and overall MS images were obtained, demonstrating the ability of this method to analyze large samples.
Collapse
Affiliation(s)
- Yu Wang
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan 610041, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bing Xia
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan 610041, China
| | - Shunyan Deng
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan 610041, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ye Ye
- Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Yan Zhou
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, Sichuan 610041, China
| |
Collapse
|
20
|
Schulze S, Pohlschroder M. Proteomic Sample Preparation and Data Analysis in Line with the Archaeal Proteome Project. Methods Mol Biol 2022; 2522:287-300. [PMID: 36125757 DOI: 10.1007/978-1-0716-2445-6_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Despite the ecological, evolutionary and economical significance of archaea, key aspects of their cell biology, metabolic pathways, and adaptations to a wide spectrum of environmental conditions, remain to be elucidated. Proteomics allows for the system-wide analysis of proteins, their changes in abundance between different conditions, as well as their post-translational modifications, providing detailed insights into the function of proteins and archaeal cell biology. In this chapter, we describe a sample preparation and mass spectrometric analysis workflow that has been designed for Haloferax volcanii but can be applied to a broad range of archaeal species. Furthermore, proteomics experiments provide a wealth of data that is invaluable to various disciplines. Therefore, we previously initiated the Archaeal Proteome Project (ArcPP), a community project that combines the analysis of multiple datasets with expert knowledge in various fields of archaeal research. The corresponding bioinformatic analysis, allowing for the integration of new proteomics data into the ArcPP, as well as the interactive exploration of ArcPP results is also presented here. In combination, these protocols facilitate an optimized, detailed and collaborative approach to archaeal proteomics.
Collapse
Affiliation(s)
- Stefan Schulze
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA.
| | | |
Collapse
|
21
|
Huang J, Jiang B, Liu M, Yang P, Cao W. gQuant, an Automated Tool for Quantitative Glycomic Data Analysis. Front Chem 2021; 9:707738. [PMID: 34395380 PMCID: PMC8355585 DOI: 10.3389/fchem.2021.707738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 07/15/2021] [Indexed: 11/25/2022] Open
Abstract
MALDI-MS-based glycan isotope labeling methods have been effectively and widely used for quantitative glycomics. However, interpretation of the data produced by MALDI-MS is inaccurate and tedious because the bioinformatic tools are inadequate. In this work, we present gQuant, an automated tool for MALDI-MS-based glycan isotope labeling data processing. gQuant was designed with a set of dedicated algorithms to improve the efficiency, accuracy and convenience of quantitation data processing. When tested on the reference data set, gQuant showed a fast processing speed, as it was able to search the glycan data of model glycoproteins in a few minutes and reported more results than the manual analysis did. The reported quantitation ratios matched well with the experimental glycan mixture ratios ranging from 1:10 to 10:1. In addition, gQuant is fully open-source and is coded in Python, which is supported by most operating systems, and it has a user-friendly interface. gQuant can be easily adapted by users for specific experimental designs, such as specific glycan databases, different derivatization types and relative quantitation designs and can thus facilitate fast glycomic quantitation for clinical sample analysis using MALDI-MS-based stable isotope labeling.
Collapse
Affiliation(s)
- Jiangming Huang
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China.,Department of Chemistry, Fudan University, Shanghai, China
| | - Biyun Jiang
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Mingqi Liu
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Pengyuan Yang
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China.,Department of Chemistry, Fudan University, Shanghai, China
| | - Weiqian Cao
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China.,NHC Key Laboratory of Glycoconjugates Research, Fudan University, Shanghai, China
| |
Collapse
|
22
|
Han S, Van Treuren W, Fischer CR, Merrill BD, DeFelice BC, Sanchez JM, Higginbottom SK, Guthrie L, Fall LA, Dodd D, Fischbach MA, Sonnenburg JL. A metabolomics pipeline for the mechanistic interrogation of the gut microbiome. Nature 2021; 595:415-420. [PMID: 34262212 PMCID: PMC8939302 DOI: 10.1038/s41586-021-03707-9] [Citation(s) in RCA: 156] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 06/08/2021] [Indexed: 02/06/2023]
Abstract
Gut microorganisms modulate host phenotypes and are associated with numerous health effects in humans, ranging from host responses to cancer immunotherapy to metabolic disease and obesity. However, difficulty in accurate and high-throughput functional analysis of human gut microorganisms has hindered efforts to define mechanistic connections between individual microbial strains and host phenotypes. One key way in which the gut microbiome influences host physiology is through the production of small molecules1-3, yet progress in elucidating this chemical interplay has been hindered by limited tools calibrated to detect the products of anaerobic biochemistry in the gut. Here we construct a microbiome-focused, integrated mass-spectrometry pipeline to accelerate the identification of microbiota-dependent metabolites in diverse sample types. We report the metabolic profiles of 178 gut microorganism strains using our library of 833 metabolites. Using this metabolomics resource, we establish deviations in the relationships between phylogeny and metabolism, use machine learning to discover a previously undescribed type of metabolism in Bacteroides, and reveal candidate biochemical pathways using comparative genomics. Microbiota-dependent metabolites can be detected in diverse biological fluids from gnotobiotic and conventionally colonized mice and traced back to the corresponding metabolomic profiles of cultured bacteria. Collectively, our microbiome-focused metabolomics pipeline and interactive metabolomics profile explorer are a powerful tool for characterizing microorganisms and interactions between microorganisms and their host.
Collapse
Affiliation(s)
- Shuo Han
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Will Van Treuren
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA,Microbiology and Immunology Graduate Program, Stanford University School of Medicine, Stanford, CA, USA
| | - Curt R. Fischer
- ChEM-H, Stanford University, Stanford, CA, USA,Chan-Zuckerburg Biohub, San Francisco, CA, USA
| | - Bryan D. Merrill
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA,Microbiology and Immunology Graduate Program, Stanford University School of Medicine, Stanford, CA, USA
| | | | | | - Steven K. Higginbottom
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Leah Guthrie
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Lalla A. Fall
- ChEM-H, Stanford University, Stanford, CA, USA,Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Dylan Dodd
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA,Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA,Address correspondence to: , ,
| | - Michael A. Fischbach
- Chan-Zuckerburg Biohub, San Francisco, CA, USA,Department of Bioengineering, Stanford University, Stanford, CA, USA,Address correspondence to: , ,
| | - Justin L. Sonnenburg
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA,Chan-Zuckerburg Biohub, San Francisco, CA, USA,Center for Human Microbiome Studies, Stanford, CA, USA,Address correspondence to: , ,
| |
Collapse
|
23
|
Comprehensive glycoproteomics shines new light on the complexity and extent of glycosylation in archaea. PLoS Biol 2021; 19:e3001277. [PMID: 34138841 PMCID: PMC8241124 DOI: 10.1371/journal.pbio.3001277] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 06/29/2021] [Accepted: 05/10/2021] [Indexed: 12/19/2022] Open
Abstract
Glycosylation is one of the most complex posttranslational protein modifications. Its importance has been established not only for eukaryotes but also for a variety of prokaryotic cellular processes, such as biofilm formation, motility, and mating. However, comprehensive glycoproteomic analyses are largely missing in prokaryotes. Here, we extend the phenotypic characterization of N-glycosylation pathway mutants in Haloferax volcanii and provide a detailed glycoproteome for this model archaeon through the mass spectrometric analysis of intact glycopeptides. Using in-depth glycoproteomic datasets generated for the wild-type (WT) and mutant strains as well as a reanalysis of datasets within the Archaeal Proteome Project (ArcPP), we identify the largest archaeal glycoproteome described so far. We further show that different N-glycosylation pathways can modify the same glycosites under the same culture conditions. The extent and complexity of the Hfx. volcanii N-glycoproteome revealed here provide new insights into the roles of N-glycosylation in archaeal cell biology. A comprehensive glycoproteomic analysis of Haloferax volcanii reveals the extent and complexity of glycosylation in archaea and provides new insights into the roles of this post-translational modification in various cellular processes, including cell shape determination.
Collapse
|
24
|
Huang YC, Tremouilhac P, Nguyen A, Jung N, Bräse S. ChemSpectra: a web-based spectra editor for analytical data. J Cheminform 2021; 13:8. [PMID: 33568182 PMCID: PMC7877097 DOI: 10.1186/s13321-020-00481-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 12/15/2020] [Indexed: 11/26/2022] Open
Abstract
ChemSpectra, a web-based software to visualize and analyze spectroscopic data, integrating solutions for infrared spectroscopy (IR), mass spectrometry (MS), and one-dimensional 1H and 13C NMR (proton and carbon nuclear magnetic resonance) spectroscopy, is described. ChemSpectra serves as web-based tool for the analysis of the most often used types of one-dimensional spectroscopic data in synthetic (organic) chemistry research. It was developed to support in particular processes for the use of open file formats which enable the work according to the FAIR data principles. The software can deal with the open file formats JCAMP-DX (IR, MS, NMR) and mzML (MS) proposing these data file types to gain interoperable data. ChemSpectra can be extended to read also other formats as exemplified by selected proprietary mass spectrometry data files of type RAW and NMR spectra files of type FID. The JavaScript-based editor can be integrated with other software, as demonstrated by integration into the Chemotion electronic lab notebook (ELN) and Chemotion repository, demonstrating the implementation into a digital work environment that offers additional functionality and sustainable research data management options. ChemSpectra supports different functions for working with spectroscopic data such as zoom functions, peak picking and automatic peak detection according to a default or manually defined threshold. NMR specific functions include the definition of a reference signal, the integration of signals, coupling constant calculation and multiplicity assignment. Embedded into a web application such as an ELN or a repository, the editor can also be used to generate an association of spectra to a sample and a file management. The file management supports the storage of the original spectra along with the last edited version and an automatically generated image of the spectra in png format. To maximize the benefit of the spectra editor for e.g. ELN users, an automated procedure for the transfer of the detected or manually chosen signals to the ELN was implemented. ChemSpectra is released under the AGPL license to encourage its re-use and further developments by the community.
Collapse
Affiliation(s)
- Yu-Chieh Huang
- Institute of Biological and Chemical Systems-Functional Molecular Systems (IBCS-FMS), Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Pierre Tremouilhac
- Institute of Biological and Chemical Systems-Functional Molecular Systems (IBCS-FMS), Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - An Nguyen
- Institute of Biological and Chemical Systems-Functional Molecular Systems (IBCS-FMS), Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Nicole Jung
- Institute of Biological and Chemical Systems-Functional Molecular Systems (IBCS-FMS), Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany. .,Institute of Organic Chemistry, Karlsruhe Institute of Technology, Fritz-Haber-Weg 6, 76131, Karlsruhe, Germany.
| | - Stefan Bräse
- Institute of Biological and Chemical Systems-Functional Molecular Systems (IBCS-FMS), Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany. .,Institute of Organic Chemistry, Karlsruhe Institute of Technology, Fritz-Haber-Weg 6, 76131, Karlsruhe, Germany.
| |
Collapse
|
25
|
Schulze S, Igiraneza AB, Kösters M, Leufken J, Leidel SA, Garcia BA, Fufezan C, Pohlschroder M. Enhancing Open Modification Searches via a Combined Approach Facilitated by Ursgal. J Proteome Res 2021; 20:1986-1996. [PMID: 33514075 DOI: 10.1021/acs.jproteome.0c00799] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The identification of peptide sequences and their post-translational modifications (PTMs) is a crucial step in the analysis of bottom-up proteomics data. The recent development of open modification search (OMS) engines allows virtually all PTMs to be searched for. This not only increases the number of spectra that can be matched to peptides but also greatly advances the understanding of the biological roles of PTMs through the identification, and the thereby facilitated quantification, of peptidoforms (peptide sequences and their potential PTMs). Whereas the benefits of combining results from multiple protein database search engines have been previously established, similar approaches for OMS results have been missing so far. Here we compare and combine results from three different OMS engines, demonstrating an increase in peptide spectrum matches of 8-18%. The unification of search results furthermore allows for the combined downstream processing of search results, including the mapping to potential PTMs. Finally, we test for the ability of OMS engines to identify glycosylated peptides. The implementation of these engines in the Python framework Ursgal facilitates the straightforward application of the OMS with unified parameters and results files, thereby enabling yet unmatched high-throughput, large-scale data analysis.
Collapse
Affiliation(s)
- Stefan Schulze
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Aime Bienfait Igiraneza
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Manuel Kösters
- Department of Chemistry and Biochemistry, University of Bern, 3012 Bern, Switzerland
| | - Johannes Leufken
- Department of Chemistry and Biochemistry, University of Bern, 3012 Bern, Switzerland
| | - Sebastian A Leidel
- Department of Chemistry and Biochemistry, University of Bern, 3012 Bern, Switzerland
| | - Benjamin A Garcia
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Christian Fufezan
- Institute of Pharmacy and Molecular Biotechnology, Heidelberg University, 69120 Heidelberg, Germany
| | - Mechthild Pohlschroder
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
26
|
Rusconi F. Free Open Source Software for Protein and Peptide Mass Spectrometry- based Science. Curr Protein Pept Sci 2021; 22:134-147. [PMID: 33461461 DOI: 10.2174/1389203722666210118160946] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Revised: 10/12/2020] [Accepted: 01/04/2021] [Indexed: 12/28/2022]
Abstract
In the field of biology, and specifically in protein and peptide science, the power of mass spectrometry is that it is applicable to a vast spectrum of applications. Mass spectrometry can be applied to identify proteins and peptides in complex mixtures, to identify and locate post-translational modifications, to characterize the structure of proteins and peptides to the most detailed level or to detect protein-ligand non-covalent interactions. Thanks to the Free and Open Source Software (FOSS) movement, scientists have limitless opportunities to deepen their skills in software development to code software that solves mass spectrometric data analysis problems. After the conversion of raw data files into open standard format files, the entire spectrum of data analysis tasks can now be performed integrally on FOSS platforms, like GNU/Linux, and only with FOSS solutions. This review presents a brief history of mass spectrometry open file formats and goes on with the description of FOSS projects that are commonly used in protein and peptide mass spectrometry fields of endeavor: identification projects that involve mostly automated pipelines, like proteomics and peptidomics, and bio-structural characterization projects that most often involve manual scrutiny of the mass data. Projects of the last kind usually involve software that allows the user to delve into the mass data in an interactive graphics-oriented manner. Software projects are thus categorized on the basis of these criteria: software libraries for software developers vs desktop-based graphical user interface, software for the end-user and automated pipeline-based data processing vs interactive graphics-based mass data scrutiny.
Collapse
Affiliation(s)
- Filippo Rusconi
- PAPPSO, Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
| |
Collapse
|
27
|
Schulze S, Oltmanns A, Fufezan C, Krägenbring J, Mormann M, Pohlschröder M, Hippler M. SugarPy facilitates the universal, discovery-driven analysis of intact glycopeptides. Bioinformatics 2020; 36:5330-5336. [PMID: 33325487 DOI: 10.1093/bioinformatics/btaa1042] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Revised: 10/26/2020] [Accepted: 12/07/2020] [Indexed: 02/07/2023] Open
Abstract
MOTIVATION Protein glycosylation is a complex post-translational modification with crucial cellular functions in all domains of life. Currently, large-scale glycoproteomics approaches rely on glycan database dependent algorithms and are thus unsuitable for discovery-driven analyses of glycoproteomes. RESULTS Therefore, we devised SugarPy, a glycan database independent Python module, and validated it on the glycoproteome of human breast milk. We further demonstrated its applicability by analyzing glycoproteomes with uncommon glycans stemming from the green alga Chlamydomonas reinhardtii and the archaeon Haloferax volcanii. SugarPy also facilitated the novel characterization of glycoproteins from the red alga Cyanidioschyzon merolae. AVAILABILITY The source code is freely available on GitHub (https://github.com/SugarPy/SugarPy), and its implementation in Python ensures support for all operating systems. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Stefan Schulze
- University of Pennsylvania, Department of Biology, Leidy Laboratories, Philadelphia, USA.,University of Muenster, Institute of Plant Biology and Biotechnology, Muenster, Germany
| | - Anne Oltmanns
- University of Muenster, Institute of Plant Biology and Biotechnology, Muenster, Germany
| | - Christian Fufezan
- University of Muenster, Institute of Plant Biology and Biotechnology, Muenster, Germany.,Heidelberg University, Institute of Pharmacy and Molecular Biotechnology, Heidelberg, Germany
| | - Julia Krägenbring
- University of Muenster, Institute of Plant Biology and Biotechnology, Muenster, Germany.,University of Muenster, Institute for Hygiene, Muenster, Germany
| | - Michael Mormann
- University of Muenster, Institute for Hygiene, Muenster, Germany
| | - Mechthild Pohlschröder
- University of Pennsylvania, Department of Biology, Leidy Laboratories, Philadelphia, USA
| | - Michael Hippler
- University of Muenster, Institute of Plant Biology and Biotechnology, Muenster, Germany.,Institute of Plant Science and Resources, Okayama University, Kurashiki, Japan
| |
Collapse
|
28
|
Dostal V, Wood SD, Thomas CT, Han Y, Lau E, Lam MPY. Proteomic signatures of acute oxidative stress response to paraquat in the mouse heart. Sci Rep 2020; 10:18440. [PMID: 33116222 PMCID: PMC7595225 DOI: 10.1038/s41598-020-75505-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 10/15/2020] [Indexed: 01/11/2023] Open
Abstract
The heart is sensitive to oxidative damage but a global view on how the cardiac proteome responds to oxidative stressors has yet to fully emerge. Using quantitative tandem mass spectrometry, we assessed the effects of acute exposure of the oxidative stress inducer paraquat on protein expression in mouse hearts. We observed widespread protein expression changes in the paraquat-exposed heart especially in organelle-containing subcellular fractions. During cardiac response to acute oxidative stress, proteome changes are consistent with a rapid reduction of mitochondrial metabolism, coupled with activation of multiple antioxidant proteins, reduction of protein synthesis and remediation of proteostasis. In addition to differential expression, we saw evidence of spatial reorganizations of the cardiac proteome including the translocation of hexokinase 2 to more soluble fractions. Treatment with the antioxidants Tempol and MitoTEMPO reversed many proteomic signatures of paraquat but this reversal was incomplete. We also identified a number of proteins with unknown function in the heart to be triggered by paraquat, suggesting they may have functions in oxidative stress response. Surprisingly, protein expression changes in the heart correlate poorly with those in the lung, consistent with differential sensitivity or stress response in these two organs. The results and data set here could provide insights into oxidative stress responses in the heart and avail the search for new therapeutic targets.
Collapse
Affiliation(s)
- Vishantie Dostal
- Department of Medicine, Division of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Consortium for Fibrosis Research & Translation, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Silas D Wood
- Department of Medicine, Division of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Consortium for Fibrosis Research & Translation, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Cody T Thomas
- Department of Medicine, Division of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Yu Han
- Department of Medicine, Division of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Consortium for Fibrosis Research & Translation, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Edward Lau
- Department of Medicine, Division of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Department of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.,Consortium for Fibrosis Research & Translation, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Maggie P Y Lam
- Department of Medicine, Division of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA. .,Department of Cardiology, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA. .,Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA. .,Consortium for Fibrosis Research & Translation, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.
| |
Collapse
|
29
|
Lee S, Hwang S, Seo M, Shin KB, Kim KH, Park GW, Kim JY, Yoo JS, No KT. BMDMS-NP: A comprehensive ESI-MS/MS spectral library of natural compounds. PHYTOCHEMISTRY 2020; 177:112427. [PMID: 32535345 DOI: 10.1016/j.phytochem.2020.112427] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 05/26/2020] [Accepted: 05/26/2020] [Indexed: 06/11/2023]
Abstract
The Bioinformatics & Molecular Design Research Center Mass Spectral Library - Natural Products (BMDMS-NP) is a library containing the mass spectra of natural compounds, especially plant specialized metabolites. At present, the library contains the electrospray ionization tandem mass spectrometry (ESI-MS/MS) spectra of 2739 plant metabolites that are commercially available. The contents of the library were made comprehensive by incorporating data generated under various experimental conditions for compounds with diverse molecular structures. The structural diversity of the BMDMS-NP data was evaluated using molecular fingerprints, and it was sufficiently exhaustive enough to represent the structures of the natural products commercially available. The MS/MS spectra of each metabolite were obtained with different types/brands of ion traps (tandem-in-time) or combinations of mass analyzers (tandem-in-space) at multiple collision energies. All spectra were measured repeatedly in each environment because variations can occur in spectra, even under the same conditions. Moreover, the probability, separability of searching, and transferability of this spectral library were evaluated against those of MS/MS libraries, namely: NIST17 and MoNA.
Collapse
Affiliation(s)
- Sangwon Lee
- Department of Biotechnology, Yonsei University, Seoul, Republic of Korea; Bioinformatics & Molecular Design Research Center, Incheon, Republic of Korea
| | - Sungbo Hwang
- Department of Biotechnology, Yonsei University, Seoul, Republic of Korea; Bioinformatics & Molecular Design Research Center, Incheon, Republic of Korea
| | - Myungwon Seo
- Department of Biotechnology, Yonsei University, Seoul, Republic of Korea; Bioinformatics & Molecular Design Research Center, Incheon, Republic of Korea
| | - Ki Beom Shin
- Department of Biotechnology, Yonsei University, Seoul, Republic of Korea; Bioinformatics & Molecular Design Research Center, Incheon, Republic of Korea
| | - Kwang Hoe Kim
- Research Center of Bioconvergence Analysis, Korea Basic Science Institute, Ochang, Republic of Korea
| | - Gun Wook Park
- Research Center of Bioconvergence Analysis, Korea Basic Science Institute, Ochang, Republic of Korea
| | - Jin Young Kim
- Research Center of Bioconvergence Analysis, Korea Basic Science Institute, Ochang, Republic of Korea
| | - Jong Shin Yoo
- Research Center of Bioconvergence Analysis, Korea Basic Science Institute, Ochang, Republic of Korea; Graduate School of Analytical Science and Technology, Chungnam National University, Daejeon, Republic of Korea
| | - Kyoung Tai No
- Department of Biotechnology, Yonsei University, Seoul, Republic of Korea; Bioinformatics & Molecular Design Research Center, Incheon, Republic of Korea.
| |
Collapse
|
30
|
Xue P, Si T, Mishra S, Zhang L, Choe K, Sweedler JV, Zhao H. A mass spectrometry-based high-throughput screening method for engineering fatty acid synthases with improved production of medium-chain fatty acids. Biotechnol Bioeng 2020; 117:2131-2138. [PMID: 32219854 DOI: 10.1002/bit.27343] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 03/07/2020] [Accepted: 03/22/2020] [Indexed: 01/09/2023]
Abstract
Microbial cell factories have been extensively engineered to produce free fatty acids (FFAs) as key components of crucial nutrients, soaps, industrial chemicals, and fuels. However, our ability to control the composition of microbially synthesized FFAs is still limited, particularly, for producing medium-chain fatty acids (MCFAs). This is mainly due to the lack of high-throughput approaches for FFA analysis to engineer enzymes with desirable product specificity. Here we report a mass spectrometry (MS)-based method for rapid profiling of MCFAs in Saccharomyces cerevisiae by using membrane lipids as a proxy. In particular, matrix-assisted laser desorption/ionization time-of-flight (MALDI-ToF) MS was used to detect shorter acyl chain phosphatidylcholines from membrane lipids and a higher m/z peak ratio at 730 and 758 was used as an indication for improved MCFA production. This colony-based method can be performed at a rate of ~2 s per sample, representing a substantial improvement over gas chromatography-MS (typically >30 min per sample) as the gold standard method for FFA detection. To demonstrate the power of this method, we performed site-saturation mutagenesis of the yeast fatty acid synthase and identified nine missense mutations that resulted in improved MCFA production relative to the wild-type strain. Colony-based MALDI-ToF MS screening provides an effective approach for engineering microbial fatty acid compositions in a high-throughput manner.
Collapse
Affiliation(s)
- Pu Xue
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Tong Si
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Shekhar Mishra
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Linzixuan Zhang
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Kisurb Choe
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Jonathan V Sweedler
- DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois
| |
Collapse
|
31
|
Bittremieux W. spectrum_utils: A Python Package for Mass Spectrometry Data Processing and Visualization. Anal Chem 2019; 92:659-661. [PMID: 31809021 DOI: 10.1021/acs.analchem.9b04884] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Given the wide diversity in applications of biological mass spectrometry, custom data analyses are often needed to fully interpret the results of an experiment. Such bioinformatics scripts necessarily include similar basic functionality to read mass spectral data from standard file formats, process it, and visualize it. Rather than having to reimplement this functionality, to facilitate this task, spectrum_utils is a Python package for mass spectrometry data processing and visualization. Its high-level functionality enables developers to quickly prototype ideas for computational mass spectrometry projects in only a few lines of code. Notably, the data processing functionality is highly optimized for computational efficiency to be able to deal with the large volumes of data that are generated during mass spectrometry experiments. The visualization functionality makes it possible to easily produce publication-quality figures as well as interactive spectrum plots for inclusion on web pages. spectrum_utils is available for Python 3.6+, includes extensive online documentation and examples, and can be easily installed using conda. It is freely available as open source under the Apache 2.0 license at https://github.com/bittremieux/spectrum_utils .
Collapse
Affiliation(s)
- Wout Bittremieux
- Skaggs School of Pharmacy and Pharmaceutical Sciences , University of California San Diego , La Jolla , California 92093 , United States.,Department of Mathematics and Computer Science , University of Antwerp , 2020 Antwerp , Belgium.,Biomedical Informatics Network Antwerpen (biomina) , 2020 Antwerp , Belgium
| |
Collapse
|
32
|
Melnikov AD, Tsentalovich YP, Yanshole VV. Deep Learning for the Precise Peak Detection in High-Resolution LC–MS Data. Anal Chem 2019; 92:588-592. [DOI: 10.1021/acs.analchem.9b04811] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Arsenty D. Melnikov
- International Tomography Center SB RAS, Institutskaya 3a, Novosibirsk 630090, Russia
- Novosibirsk State University, Pirogova 2, Novosibirsk 630090, Russia
| | - Yuri P. Tsentalovich
- International Tomography Center SB RAS, Institutskaya 3a, Novosibirsk 630090, Russia
- Novosibirsk State University, Pirogova 2, Novosibirsk 630090, Russia
| | - Vadim V. Yanshole
- International Tomography Center SB RAS, Institutskaya 3a, Novosibirsk 630090, Russia
- Novosibirsk State University, Pirogova 2, Novosibirsk 630090, Russia
| |
Collapse
|
33
|
Proof of concept for identifying cystic fibrosis from perspiration samples. Proc Natl Acad Sci U S A 2019; 116:24408-24412. [PMID: 31740593 DOI: 10.1073/pnas.1909630116] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The gold standard for cystic fibrosis (CF) diagnosis is the determination of chloride concentration in sweat. Current testing methodology takes up to 3 h to complete and has recognized shortcomings on its diagnostic accuracy. We present an alternative method for the identification of CF by combining desorption electrospray ionization mass spectrometry and a machine-learning algorithm based on gradient boosted decision trees to analyze perspiration samples. This process takes as little as 2 min, and we determined its accuracy to be 98 ± 2% by cross-validation on analyzing 277 perspiration samples. With the introduction of statistical bootstrap, our method can provide a confidence estimate of our prediction, which helps diagnosis decision-making. We also identified important peaks by the feature selection algorithm and assigned the chemical structure of the metabolites by high-resolution and/or tandem mass spectrometry. We inspected the correlation between mild and severe CFTR gene mutation types and lipid profiles, suggesting a possible way to realize personalized medicine with this noninvasive, fast, and accurate method.
Collapse
|
34
|
Wandy J, Davies V, J J van der Hooft J, Weidt S, Daly R, Rogers S. In Silico Optimization of Mass Spectrometry Fragmentation Strategies in Metabolomics. Metabolites 2019; 9:E219. [PMID: 31600991 PMCID: PMC6836109 DOI: 10.3390/metabo9100219] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 09/27/2019] [Accepted: 10/02/2019] [Indexed: 12/20/2022] Open
Abstract
Liquid chromatography (LC) coupled to tandem mass spectrometry (MS/MS) is widely used in identifying small molecules in untargeted metabolomics. Various strategies exist to acquire MS/MS fragmentation spectra; however, the development of new acquisition strategies is hampered by the lack of simulators that let researchers prototype, compare, and optimize strategies before validations on real machines. We introduce Virtual Metabolomics Mass Spectrometer (ViMMS), a metabolomics LC-MS/MS simulator framework that allows for scan-level control of the MS2 acquisition process in silico. ViMMS can generate new LC-MS/MS data based on empirical data or virtually re-run a previous LC-MS/MS analysis using pre-existing data to allow the testing of different fragmentation strategies. To demonstrate its utility, we show how ViMMS can be used to optimize N for Top-N data-dependent acquisition (DDA) acquisition, giving results comparable to modifying N on the mass spectrometer. We expect that ViMMS will save method development time by allowing for offline evaluation of novel fragmentation strategies and optimization of the fragmentation strategy for a particular experiment.
Collapse
Affiliation(s)
- Joe Wandy
- Glasgow Polyomics, University of Glasgow, Glasgow G61 1BD, UK.
| | - Vinny Davies
- School of Computing Science, University of Glasgow, Glasgow G12 8RZ, UK.
| | - Justin J J van der Hooft
- Bioinformatics Group, Department of Plant Sciences, Wageningen University, 6780 PB Wageningen, The Netherlands.
| | - Stefan Weidt
- Glasgow Polyomics, University of Glasgow, Glasgow G61 1BD, UK.
| | - Rónán Daly
- Glasgow Polyomics, University of Glasgow, Glasgow G61 1BD, UK.
| | - Simon Rogers
- School of Computing Science, University of Glasgow, Glasgow G12 8RZ, UK.
| |
Collapse
|
35
|
Abstract
Glycoinformatics is a critical resource for the study of glycobiology, and glycobiology is a necessary component for understanding the complex interface between intra- and extracellular spaces. Despite this, there is limited software available to scientists studying these topics, requiring each to create fundamental data structures and representations anew for each of their applications. This leads to poor uptake of standardization and loss of focus on the real problems. We present glypy, a library written in Python for reading, writing, manipulating, and transforming glycans at several levels of precision. In addition to understanding several common formats for textual representation of glycans, the library also provides application programming interfaces (APIs) for major community databases, including GlyTouCan and UnicarbKB. The library is freely available under the Apache 2 common license with source code available at https://github.com/mobiusklein/ and documentation at https://glypy.readthedocs.io/ .
Collapse
Affiliation(s)
- Joshua Klein
- Program for Bioinformatics , Boston University , Boston , Massachusetts 02215 , United States
| | - Joseph Zaia
- Program for Bioinformatics , Boston University , Boston , Massachusetts 02215 , United States.,Department of Biochemistry , Boston University , Boston , Massachusetts 02215 , United States
| |
Collapse
|
36
|
Klein J, Zaia J. psims - A Declarative Writer for mzML and mzIdentML for Python. Mol Cell Proteomics 2019; 18:571-575. [PMID: 30563850 PMCID: PMC6398200 DOI: 10.1074/mcp.rp118.001070] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 12/12/2018] [Indexed: 01/04/2023] Open
Abstract
mzML and mzIdentML are commonly used, powerful tools for representing mass spectrometry data and derived identification information. These formats are complex, requiring non-trivial logic to translate data into the appropriate representation. Most published implementations are tightly coupled to data structures. The most complete implementations are written in compiled languages that cannot expose the complete flexibility of the implementation to external programs or bindings. To our knowledge, there are no complete implementations for mzML or mzIdentML available to scripting languages like Python or R. We present psims, a library written in Python for writing mzML and mzIdentML. The library allows writing either XML format using built-in Python data structures. It includes a controlled vocabulary resolution system to simplify the encoding process and an identity tracking system to manage entity relationships. The source code is available at https://github.com/mobiusklein/psims, and through the Python Package Index as psims, licensed under the Apache 2 common license.
Collapse
Affiliation(s)
- Joshua Klein
- From the ‡Program for Bioinformatics, Boston University, Boston, Massachusetts 02215
| | - Joseph Zaia
- From the ‡Program for Bioinformatics, Boston University, Boston, Massachusetts 02215;
- §Department of Biochemistry, Boston University, Boston, Massachusetts 02118
| |
Collapse
|
37
|
Levitsky LI, Klein JA, Ivanov MV, Gorshkov MV. Pyteomics 4.0: Five Years of Development of a Python Proteomics Framework. J Proteome Res 2019; 18:709-714. [PMID: 30576148 DOI: 10.1021/acs.jproteome.8b00717] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Many of the novel ideas that drive today's proteomic technologies are focused essentially on experimental or data-processing workflows. The latter are implemented and published in a number of ways, from custom scripts and programs, to projects built using general-purpose or specialized workflow engines; a large part of routine data processing is performed manually or with custom scripts that remain unpublished. Facilitating the development of reproducible data-processing workflows becomes essential for increasing the efficiency of proteomic research. To assist in overcoming the bioinformatics challenges in the daily practice of proteomic laboratories, 5 years ago we developed and announced Pyteomics, a freely available open-source library providing Python interfaces to proteomic data. We summarize the new functionality of Pyteomics developed during the time since its introduction.
Collapse
Affiliation(s)
- Lev I Levitsky
- Moscow Institute of Physics and Technology , Dolgoprudny, Moscow Region 141701 , Russia.,V.L. Talrose Institute for Energy Problems of Chemical Physics , Russian Academy of Sciences , Moscow 119334 , Russia
| | - Joshua A Klein
- Bioinformatics Program , Boston University , Boston , Massachusetts 02215 , United States
| | - Mark V Ivanov
- V.L. Talrose Institute for Energy Problems of Chemical Physics , Russian Academy of Sciences , Moscow 119334 , Russia
| | - Mikhail V Gorshkov
- V.L. Talrose Institute for Energy Problems of Chemical Physics , Russian Academy of Sciences , Moscow 119334 , Russia
| |
Collapse
|
38
|
Sarin LP, Kienast SD, Leufken J, Ross RL, Dziergowska A, Debiec K, Sochacka E, Limbach PA, Fufezan C, Drexler HCA, Leidel SA. Nano LC-MS using capillary columns enables accurate quantification of modified ribonucleosides at low femtomol levels. RNA (NEW YORK, N.Y.) 2018; 24:1403-1417. [PMID: 30012570 PMCID: PMC6140458 DOI: 10.1261/rna.065482.117] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2017] [Accepted: 07/11/2018] [Indexed: 05/14/2023]
Abstract
Post-transcriptional chemical modifications of (t)RNA molecules are crucial in fundamental biological processes, such as translation. Despite their biological importance and accumulating evidence linking them to various human diseases, technical challenges have limited their detection and accurate quantification. Here, we present a sensitive capillary nanoflow liquid chromatography mass spectrometry (nLC-MS) pipeline for quantitative high-resolution analysis of ribonucleoside modifications from complex biological samples. We evaluated two porous graphitic carbon (PGC) materials and one end-capped C18 reference material as stationary phases for reversed-phase separation. We found that these matrices have complementing retention and separation characteristics, including the capability to separate structural isomers. PGC and C18 matrices yielded excellent signal-to-noise ratios in nLC-MS while differing in the separation capability and sensitivity for various nucleosides. This emphasizes the need for tailored LC-MS setups for optimally detecting as many nucleoside modifications as possible. Detection ranges spanning up to six orders of magnitude enable the analysis of individual ribonucleosides down to femtomol concentrations. Furthermore, normalizing the obtained signal intensities to a stable isotope labeled spike-in enabled direct comparison of ribonucleoside levels between different samples. In conclusion, capillary columns coupled to nLC-MS constitute a powerful and sensitive tool for quantitative analysis of modified ribonucleosides in complex biological samples. This setup will be invaluable for further unraveling the intriguing and multifaceted biological roles of RNA modifications.
Collapse
Affiliation(s)
- L Peter Sarin
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Muenster, 48149, Germany
- Cells-in-Motion Cluster of Excellence, University of Muenster, Muenster, 48149, Germany
| | - Sandra D Kienast
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Muenster, 48149, Germany
- Cells-in-Motion Cluster of Excellence, University of Muenster, Muenster, 48149, Germany
| | - Johannes Leufken
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Muenster, 48149, Germany
- Cells-in-Motion Cluster of Excellence, University of Muenster, Muenster, 48149, Germany
| | - Robert L Ross
- Rieveschl Laboratories for Mass Spectrometry, Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221-0172, USA
| | - Agnieszka Dziergowska
- Institute of Organic Chemistry, Faculty of Chemistry, Lodz University of Technology, 90-924 Lodz, Poland
| | - Katarzyna Debiec
- Institute of Organic Chemistry, Faculty of Chemistry, Lodz University of Technology, 90-924 Lodz, Poland
| | - Elzbieta Sochacka
- Institute of Organic Chemistry, Faculty of Chemistry, Lodz University of Technology, 90-924 Lodz, Poland
| | - Patrick A Limbach
- Rieveschl Laboratories for Mass Spectrometry, Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221-0172, USA
| | - Christian Fufezan
- Institute of Plant Biology and Biotechnology, University of Muenster, Muenster, 48143, Germany
| | - Hannes C A Drexler
- Bioanalytical Mass Spectrometry Unit, Max Planck Institute for Molecular Biomedicine, Muenster, 48149, Germany
| | - Sebastian A Leidel
- Max Planck Research Group for RNA Biology, Max Planck Institute for Molecular Biomedicine, Muenster, 48149, Germany
- Cells-in-Motion Cluster of Excellence, University of Muenster, Muenster, 48149, Germany
| |
Collapse
|