1
|
Hu M, Yang L, Twarog N, Ochoada J, Li Y, Vrettos EI, Torres-Hernandez AX, Martinez JB, Bhatia J, Young BM, Price J, McGowan K, Nguyen TH, Shi Z, Anyanwu M, Rimmer MA, Mercer S, Rankovic Z, Shelat AA, Blair DJ. Continuous collective analysis of chemical reactions. Nature 2024; 636:374-379. [PMID: 39663496 PMCID: PMC11823688 DOI: 10.1038/s41586-024-08211-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Accepted: 10/14/2024] [Indexed: 12/13/2024]
Abstract
The automated synthesis of small organic molecules from modular building blocks has the potential to transform our capacity to create medicines and materials1-3. Disruptive acceleration of this molecule-building strategy broadly unlocks its functional potential and requires the integration of many new assembly chemistries. Although recent advances in high-throughput chemistry4-6 can speed up the development of appropriate synthetic methods, for example, in selecting appropriate chemical reaction conditions from the vast range of potential options, equivalent high-throughput analytical methods are needed. Here we report a streamlined approach for the rapid, quantitative analysis of chemical reactions by mass spectrometry. The intrinsic fragmentation features of chemical building blocks generalize the analyses of chemical reactions, allowing sub-second readouts of reaction outcomes. Central to this advance was identifying that starting material fragmentation patterns function as universal barcodes for downstream product analysis by mass spectrometry. Combining these features with acoustic droplet ejection mass spectrometry7,8 we could eliminate slow chromatographic steps and continuously evaluate chemical reactions in multiplexed formats. This enabled the assignment of reaction conditions to molecules derived from ultrahigh-throughput chemical synthesis experiments. More generally, these results indicate that fragmentation features inherent to chemical synthesis can empower rapid data-rich experimentation.
Collapse
Affiliation(s)
- Maowei Hu
- Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Lei Yang
- Analytical Technologies Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Nathaniel Twarog
- Lead Discovery Informatics Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Jason Ochoada
- Lead Discovery Informatics Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Yong Li
- Analytical Technologies Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Eirinaios I Vrettos
- Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | | | - James B Martinez
- Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Jiya Bhatia
- Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Brandon M Young
- Medicinal Chemistry Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Jeanine Price
- Medicinal Chemistry Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Kevin McGowan
- Medicinal Chemistry Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Theresa H Nguyen
- Medicinal Chemistry Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Zhe Shi
- Medicinal Chemistry Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Matthew Anyanwu
- Lead Discovery Informatics Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Mary Ashley Rimmer
- Analytical Technologies Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Shea Mercer
- Program Management, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Zoran Rankovic
- Analytical Technologies Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
- Medicinal Chemistry Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Anang A Shelat
- Lead Discovery Informatics Center, Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Daniel J Blair
- Department of Chemical Biology and Therapeutics, St Jude Children's Research Hospital, Memphis, TN, USA.
| |
Collapse
|
2
|
Cho YB, Kim JG, Han JS, An BK, Lee D, Lee MK, Hwang BY. LC-HRMS/MS-Guided Isolation of Unusual Diarylheptanoids from the Rhizomes of Alpinia officinarum. ACS OMEGA 2024; 9:46484-46491. [PMID: 39583693 PMCID: PMC11579737 DOI: 10.1021/acsomega.4c07987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Revised: 10/24/2024] [Accepted: 10/29/2024] [Indexed: 11/26/2024]
Abstract
LC-HRMS/MS analysis facilitated the precise targeting, isolation, and identification of unusual dimeric diarylheptanoids from Alpinia officinarum (A. officinarum). The tandem MS data for (4E)-1,7-diphenyl-4-hepten-3-one (7) revealed fragment ions at m/z 91, 105, and 117, which are fragmentation patterns specific to diarylheptanoids. In the tandem MS data, peaks with m/z values ranging from 450 to 600 that exhibited these specific fragment ions were selected and isolated. Consequently, two previously undescribed dimeric diarylheptanoids (1 and 2) and four unusual diarylheptanoids (3-6) along with 10 monomeric diarylheptanoids (7-16) were isolated from the rhizomes of A. officinarum using various chromatographic techniques. The structures of the isolates were elucidated by an analysis of 1D/2D NMR and HRESIMS data, and a combination of DP4+ probability analysis and ECD calculations. To evaluate the anti-inflammatory effects of the isolated compounds, their inhibitory activity against nitric oxide production in LPS-induced RAW 264.7 cells was assessed. Compounds 1, 7, and 9 exhibited remarkable inhibitory effects with IC50 values of 14.7, 6.6, and 5.0 μM, respectively.
Collapse
Affiliation(s)
- Yong Beom Cho
- College
of Pharmacy, Chungbuk National University, Cheongju 28160, Republic of Korea
| | - Jun Gu Kim
- College
of Pharmacy, Chungbuk National University, Cheongju 28160, Republic of Korea
| | - Jae Sang Han
- College
of Pharmacy, Chungbuk National University, Cheongju 28160, Republic of Korea
| | - Beom Kyun An
- College
of Pharmacy, Chungbuk National University, Cheongju 28160, Republic of Korea
| | - Dongho Lee
- Department
of Plant Biotechnology, College of Life Sciences and Biotechnology, Korea University, Seoul 02841, Republic of Korea
| | - Mi Kyeong Lee
- College
of Pharmacy, Chungbuk National University, Cheongju 28160, Republic of Korea
| | - Bang Yeon Hwang
- College
of Pharmacy, Chungbuk National University, Cheongju 28160, Republic of Korea
| |
Collapse
|
3
|
Liang H, Luo Y, van der Donk WA. Substrate Specificity of a Methyltransferase Involved in the Biosynthesis of the Lantibiotic Cacaoidin. Biochemistry 2024; 63:2493-2505. [PMID: 39271288 PMCID: PMC11447909 DOI: 10.1021/acs.biochem.4c00150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Revised: 09/01/2024] [Accepted: 09/06/2024] [Indexed: 09/15/2024]
Abstract
Modification of the N- and C-termini of peptides enhances their stability against degradation by exopeptidases. The biosynthetic pathways of many peptidic natural products feature enzymatic modification of their termini, and these enzymes may represent a valuable pool of biocatalysts. The lantibiotic cacaoidin carries an N,N-dimethylated N-terminal amine group. Its biosynthetic gene cluster encodes the putative methyltransferase Cao4. In this work, we present reconstitution of the activity of the enzyme, which we termed CaoSC following standardized lanthipeptide nomenclature, using a heterologously produced peptide as the model substrate. In vitro methylation of diverse lanthipeptides revealed the substrate requirements of CaoSC. The enzyme accepts peptides of varying lengths and C-terminal sequences but requires dehydroalanine or dehydrobutyrine at the second position. CaoSC-mediated dimethylation of natural lantibiotics resulted in modestly enhanced antimicrobial activity of the lantibiotic haloduracin compared to that of the native compound. Improved activity and/or metabolic stability as a result of methylation illustrates the potential future application of CaoSC in the bioengineering of therapeutic peptides.
Collapse
Affiliation(s)
- Haoqian Liang
- Department
of Biochemistry, University of Illinois
at Urbana−Champaign, Urbana, Illinois 61801, United States
| | - Youran Luo
- Department
of Chemistry and Howard Hughes Medical Institute, University of Illinois at Urbana−Champaign, Urbana, Illinois 61801, United States
| | - Wilfred A. van der Donk
- Department
of Biochemistry, University of Illinois
at Urbana−Champaign, Urbana, Illinois 61801, United States
- Department
of Chemistry and Howard Hughes Medical Institute, University of Illinois at Urbana−Champaign, Urbana, Illinois 61801, United States
- Carl
R. Woese Institute for Genomic Biology, University of Illinois at Urbana−Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
4
|
Pang Z, Lu Y, Zhou G, Hui F, Xu L, Viau C, Spigelman A, MacDonald P, Wishart D, Li S, Xia J. MetaboAnalyst 6.0: towards a unified platform for metabolomics data processing, analysis and interpretation. Nucleic Acids Res 2024; 52:W398-W406. [PMID: 38587201 PMCID: PMC11223798 DOI: 10.1093/nar/gkae253] [Citation(s) in RCA: 379] [Impact Index Per Article: 379.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 03/14/2024] [Accepted: 03/26/2024] [Indexed: 04/09/2024] Open
Abstract
We introduce MetaboAnalyst version 6.0 as a unified platform for processing, analyzing, and interpreting data from targeted as well as untargeted metabolomics studies using liquid chromatography - mass spectrometry (LC-MS). The two main objectives in developing version 6.0 are to support tandem MS (MS2) data processing and annotation, as well as to support the analysis of data from exposomics studies and related experiments. Key features of MetaboAnalyst 6.0 include: (i) a significantly enhanced Spectra Processing module with support for MS2 data and the asari algorithm; (ii) a MS2 Peak Annotation module based on comprehensive MS2 reference databases with fragment-level annotation; (iii) a new Statistical Analysis module dedicated for handling complex study design with multiple factors or phenotypic descriptors; (iv) a Causal Analysis module for estimating metabolite - phenotype causal relations based on two-sample Mendelian randomization, and (v) a Dose-Response Analysis module for benchmark dose calculations. In addition, we have also improved MetaboAnalyst's visualization functions, updated its compound database and metabolite sets, and significantly expanded its pathway analysis support to around 130 species. MetaboAnalyst 6.0 is freely available at https://www.metaboanalyst.ca.
Collapse
Affiliation(s)
- Zhiqiang Pang
- Institute of Parasitology, McGill University,Sainte-Anne-de-Bellevue, Quebec, Canada
| | - Yao Lu
- Department of Microbiology and Immunology, McGill University, Montreal, Quebec, Canada
| | - Guangyan Zhou
- Institute of Parasitology, McGill University,Sainte-Anne-de-Bellevue, Quebec, Canada
| | - Fiona Hui
- Institute of Parasitology, McGill University,Sainte-Anne-de-Bellevue, Quebec, Canada
| | - Lei Xu
- Institute of Parasitology, McGill University,Sainte-Anne-de-Bellevue, Quebec, Canada
| | - Charles Viau
- Institute of Parasitology, McGill University,Sainte-Anne-de-Bellevue, Quebec, Canada
| | - Aliya F Spigelman
- Department of Pharmacology and Alberta Diabetes Institute, University of Alberta, Edmonton, Alberta, Canada
| | - Patrick E MacDonald
- Department of Pharmacology and Alberta Diabetes Institute, University of Alberta, Edmonton, Alberta, Canada
| | - David S Wishart
- Departments of Biological Sciences and Computing Science, University of Alberta, Edmonton, Alberta, Canada
| | - Shuzhao Li
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- University of Connecticut School of Medicine, Farmington, CT, USA
| | - Jianguo Xia
- Institute of Parasitology, McGill University,Sainte-Anne-de-Bellevue, Quebec, Canada
- Department of Microbiology and Immunology, McGill University, Montreal, Quebec, Canada
| |
Collapse
|
5
|
Bui-Thi D, Liu Y, Lippens JL, Laukens K, De Vijlder T. TransExION: a transformer based explainable similarity metric for comparing IONS in tandem mass spectrometry. J Cheminform 2024; 16:61. [PMID: 38807166 PMCID: PMC11134763 DOI: 10.1186/s13321-024-00858-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 05/12/2024] [Indexed: 05/30/2024] Open
Abstract
Small molecule identification is a crucial task in analytical chemistry and life sciences. One of the most commonly used technologies to elucidate small molecule structures is mass spectrometry. Spectral library search of product ion spectra (MS/MS) is a popular strategy to identify or find structural analogues. This approach relies on the assumption that spectral similarity and structural similarity are correlated. However, popular spectral similarity measures, usually calculated based on identical fragment matches between the MS/MS spectra, do not always accurately reflect the structural similarity. In this study, we propose TransExION, a Transformer based Explainable similarity metric for IONS. TransExION detects related fragments between MS/MS spectra through their mass difference and uses these to estimate spectral similarity. These related fragments can be nearly identical, but can also share a substructure. TransExION also provides a post-hoc explanation of its estimation, which can be used to support scientists in evaluating the spectral library search results and thus in structure elucidation of unknown molecules. Our model has a Transformer based architecture and it is trained on the data derived from GNPS MS/MS libraries. The experimental results show that it improves existing spectral similarity measures in searching and interpreting structural analogues as well as in molecular networking. SCIENTIFIC CONTRIBUTION: We propose a transformer-based spectral similarity metrics that improves the comparison of small molecule tandem mass spectra. We provide a post hoc explanation that can serve as a good starting point for unknown spectra annotation based on database spectra.
Collapse
Affiliation(s)
- Danh Bui-Thi
- Computer Science Department, University of Antwerp, Middelheimlaan 1, 2020, Antwerp, Belgium
| | - Youzhong Liu
- Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Jennifer L Lippens
- Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Kris Laukens
- Computer Science Department, University of Antwerp, Middelheimlaan 1, 2020, Antwerp, Belgium
| | - Thomas De Vijlder
- Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium.
| |
Collapse
|
6
|
Pang Z, Xu L, Viau C, Lu Y, Salavati R, Basu N, Xia J. MetaboAnalystR 4.0: a unified LC-MS workflow for global metabolomics. Nat Commun 2024; 15:3675. [PMID: 38693118 PMCID: PMC11063062 DOI: 10.1038/s41467-024-48009-6] [Citation(s) in RCA: 51] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 04/18/2024] [Indexed: 05/03/2024] Open
Abstract
The wide applications of liquid chromatography - mass spectrometry (LC-MS) in untargeted metabolomics demand an easy-to-use, comprehensive computational workflow to support efficient and reproducible data analysis. However, current tools were primarily developed to perform specific tasks in LC-MS based metabolomics data analysis. Here we introduce MetaboAnalystR 4.0 as a streamlined pipeline covering raw spectra processing, compound identification, statistical analysis, and functional interpretation. The key features of MetaboAnalystR 4.0 includes an auto-optimized feature detection and quantification algorithm for LC-MS1 spectra processing, efficient MS2 spectra deconvolution and compound identification for data-dependent or data-independent acquisition, and more accurate functional interpretation through integrated spectral annotation. Comprehensive validation studies using LC-MS1 and MS2 spectra obtained from standards mixtures, dilution series and clinical metabolomics samples have shown its excellent performance across a wide range of common tasks such as peak picking, spectral deconvolution, and compound identification with good computing efficiency. Together with its existing statistical analysis utilities, MetaboAnalystR 4.0 represents a significant step toward a unified, end-to-end workflow for LC-MS based global metabolomics in the open-source R environment.
Collapse
Affiliation(s)
- Zhiqiang Pang
- Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| | - Lei Xu
- Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| | - Charles Viau
- Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| | - Yao Lu
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
| | - Reza Salavati
- Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| | - Niladri Basu
- Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada
| | - Jianguo Xia
- Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada.
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada.
| |
Collapse
|
7
|
Engler Hart C, Kind T, Dorrestein PC, Healey D, Domingo-Fernández D. Weighting Low-Intensity MS/MS Ions and m/ z Frequency for Spectral Library Annotation. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:266-274. [PMID: 38271611 PMCID: PMC10854760 DOI: 10.1021/jasms.3c00353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 12/29/2023] [Accepted: 01/05/2024] [Indexed: 01/27/2024]
Abstract
Calculating spectral similarity is a fundamental step in MS/MS data analysis in untargeted metabolomics experiments, as it facilitates the identification of related spectra and the annotation of compounds. To improve matching accuracy when querying an experimental mass spectrum against a spectral library, previous approaches have proposed increasing peak intensities for high m/z ranges. These high m/z values tend to be smaller in magnitude, yet they offer more crucial information for identifying the chemical structure. Here, we evaluate the impact of using these weights for identifying structurally related compounds and mass spectral library searches. Additionally, we propose a weighting approach that (i) takes into account the frequency of the m/z values within a spectral library in order to assign higher importance to the most common peaks and (ii) increases the intensity of lower peaks, similar to previous approaches. To demonstrate our approach, we applied weighting preprocessing to modified cosine, entropy, and fidelity distance metrics and benchmarked it against previously reported weights. Our results demonstrate how weighting-based preprocessing can assist in annotating the structure of unknown spectra as well as identifying structurally similar compounds. Finally, we examined scenarios in which the utilization of weights resulted in diminished performance, pinpointing spectral features where the application of weights might be detrimental.
Collapse
Affiliation(s)
- Chloe Engler Hart
- Enveda Biosciences, 5700 Flatiron Parkway, Boulder, Colorado 80301, United States
| | - Tobias Kind
- Enveda Biosciences, 5700 Flatiron Parkway, Boulder, Colorado 80301, United States
| | - Pieter C. Dorrestein
- Collaborative
Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and
Pharmaceutical Sciences, University of California
San Diego, La Jolla, California 92093, United States
| | - David Healey
- Enveda Biosciences, 5700 Flatiron Parkway, Boulder, Colorado 80301, United States
| | | |
Collapse
|
8
|
Bittremieux W, Avalon NE, Thomas SP, Kakhkhorov SA, Aksenov AA, Gomes PWP, Aceves CM, Caraballo-Rodríguez AM, Gauglitz JM, Gerwick WH, Huan T, Jarmusch AK, Kaddurah-Daouk RF, Kang KB, Kim HW, Kondić T, Mannochio-Russo H, Meehan MJ, Melnik AV, Nothias LF, O'Donovan C, Panitchpakdi M, Petras D, Schmid R, Schymanski EL, van der Hooft JJJ, Weldon KC, Yang H, Xing S, Zemlin J, Wang M, Dorrestein PC. Open access repository-scale propagated nearest neighbor suspect spectral library for untargeted metabolomics. Nat Commun 2023; 14:8488. [PMID: 38123557 PMCID: PMC10733301 DOI: 10.1038/s41467-023-44035-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/28/2023] [Indexed: 12/23/2023] Open
Abstract
Despite the increasing availability of tandem mass spectrometry (MS/MS) community spectral libraries for untargeted metabolomics over the past decade, the majority of acquired MS/MS spectra remain uninterpreted. To further aid in interpreting unannotated spectra, we created a nearest neighbor suspect spectral library, consisting of 87,916 annotated MS/MS spectra derived from hundreds of millions of MS/MS spectra originating from published untargeted metabolomics experiments. Entries in this library, or "suspects," were derived from unannotated spectra that could be linked in a molecular network to an annotated spectrum. Annotations were propagated to unknowns based on structural relationships to reference molecules using MS/MS-based spectrum alignment. We demonstrate the broad relevance of the nearest neighbor suspect spectral library through representative examples of propagation-based annotation of acylcarnitines, bacterial and plant natural products, and drug metabolism. Our results also highlight how the library can help to better understand an Alzheimer's brain phenotype. The nearest neighbor suspect spectral library is openly available for download or for data analysis through the GNPS platform to help investigators hypothesize candidate structures for unknown MS/MS spectra in untargeted metabolomics data.
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Computer Science, University of Antwerp, 2020, Antwerpen, Belgium.
| | - Nicole E Avalon
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, 92093, USA
| | - Sydney P Thomas
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Sarvar A Kakhkhorov
- Laboratory of Physical and Chemical Methods of Research, Center for Advanced Technologies, Tashkent, 100174, Uzbekistan
- Department of Food Science, Faculty of Science, University of Copenhagen, Rolighedsvej 26, 1958, Frederiksberg C, Denmark
| | - Alexander A Aksenov
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Department of Chemistry, University of Connecticut, Storrs, CT, 06269, USA
- Arome Science inc., Farmington, CT, 06032, USA
| | - Paulo Wender P Gomes
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Christine M Aceves
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| | - Andrés Mauricio Caraballo-Rodríguez
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Julia M Gauglitz
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - William H Gerwick
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
| | - Tao Huan
- Department of Chemistry, University of British Columbia, Vancouver, BC, V6T 1Z1, Canada
| | - Alan K Jarmusch
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Immunity, Inflammation, and Disease Laboratory, Division of Intramural Research, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, Durham, NC, 27709, USA
| | - Rima F Kaddurah-Daouk
- Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, NC, 27701, USA
- Department of Medicine, Duke University, Durham, NC, 27710, USA
- Duke Institute of Brain Sciences, Duke University, Durham, NC, 27710, USA
| | - Kyo Bin Kang
- College of Pharmacy and Research Institute of Pharmaceutical Sciences, Sookmyung Women's University, Seoul, 04310, Korea
| | - Hyun Woo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University, Goyang, 10326, Korea
| | - Todor Kondić
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4367, Belvaux, Luxembourg
| | - Helena Mannochio-Russo
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Department of Biochemistry and Organic Chemistry, Institute of Chemistry, São Paulo State University, Araraquara, 14800-901, Brazil
| | - Michael J Meehan
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Alexey V Melnik
- Department of Chemistry, University of Connecticut, Storrs, CT, 06269, USA
- Arome Science inc., Farmington, CT, 06032, USA
| | - Louis-Felix Nothias
- Université Côte d'Azur, CNRS, ICN, Nice, France
- Interdisciplinary Institute for Artificial Intelligence (3iA) Côte d'Azur, Nice, France
| | - Claire O'Donovan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Morgan Panitchpakdi
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Daniel Petras
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, 72076, Tuebingen, Germany
- Department of Biochemistry, University of California Riverside, Riverside, CA, 92507, USA
| | - Robin Schmid
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Emma L Schymanski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4367, Belvaux, Luxembourg
| | - Justin J J van der Hooft
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, The Netherlands
| | - Kelly C Weldon
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Heejung Yang
- Laboratory of Natural Products Chemistry, College of Pharmacy, Kangwon National University, Chuncheon, 24341, Korea
| | - Shipei Xing
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Department of Chemistry, University of British Columbia, Vancouver, BC, V6T 1Z1, Canada
| | - Jasmine Zemlin
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Mingxun Wang
- Department of Computer Science and Engineering, University of California Riverside, Riverside, CA, 92507, USA
| | - Pieter C Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA.
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
9
|
Latz M, Böhme A, Ulrich N. Reactivity-based identification of oxygen containing functional groups of chemicals applied as potential classifier in non-target analysis. Sci Rep 2023; 13:22828. [PMID: 38129561 PMCID: PMC10739825 DOI: 10.1038/s41598-023-50240-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 12/17/2023] [Indexed: 12/23/2023] Open
Abstract
In this work, we developed a reactivity-based strategy to identify functional groups of unknown analytes, which can be applied as classifier in non-target analysis with gas chromatography. The aim of this strategy is to reduce the number of potential candidate structures generated for a molecular formula determined by high resolution mass spectrometry. We selected an example of 18 isomers with the molecular formula C12H10O2 to test the performance of different derivatization reagents, whereas our aim was to select mild and fast reaction conditions. Based on the results for the isomers, we developed a four-step workflow for the identification of functional groups containing oxygen.
Collapse
Affiliation(s)
- Milena Latz
- Department of Ecological Chemistry, Helmholtz Centre for Environmental Research - UFZ, 04318, Leipzig, Germany
- Faculty of Chemistry and Mineralogy, Leipzig University, 04103, Leipzig, Germany
| | - Alexander Böhme
- Department of Ecological Chemistry, Helmholtz Centre for Environmental Research - UFZ, 04318, Leipzig, Germany
| | - Nadin Ulrich
- Department of Ecological Chemistry, Helmholtz Centre for Environmental Research - UFZ, 04318, Leipzig, Germany.
| |
Collapse
|
10
|
Gong D, Li B, Wu B, Fu D, Li Z, Wei H, Guo S, Ding G, Wang B. The Integration of the Metabolome and Transcriptome for Dendrobium nobile Lindl. in Response to Methyl Jasmonate. Molecules 2023; 28:7892. [PMID: 38067620 PMCID: PMC10707931 DOI: 10.3390/molecules28237892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/23/2023] [Accepted: 11/28/2023] [Indexed: 12/18/2023] Open
Abstract
Dendrobium nobile Lindl., as an endangered medicinal plant within the genus Dendrobium, is widely distributed in southwestern China and has important ecological and economic value. There are a variety of metabolites with pharmacological activity in D. nobile. The alkaloids and polysaccharides contained within D. nobile are very important active components, which mainly have antiviral, anti-tumor, and immunity improvement effects. However, the changes in the compounds and functional genes of D. nobile induced by methyl jasmonate (MeJA) are not clearly understood. In this study, the metabolome and transcriptome of D. nobile were analyzed after exposure to MeJA. A total of 377 differential metabolites were obtained through data analysis, of which 15 were related to polysaccharide pathways and 35 were related to terpenoids and alkaloids pathways. Additionally, the transcriptome sequencing results identified 3256 differentially expressed genes that were discovered in 11 groups. Compared with the control group, 1346 unigenes were differentially expressed in the samples treated with MeJA for 14 days (TF14). Moreover, the expression levels of differentially expressed genes were also significant at different growth and development stages. According to GO and KEGG annotations, 189 and 99 candidate genes were identified as being involved in terpenoid biosynthesis and polysaccharide biosynthesis, respectively. In addition, the co-expression analysis indicated that 238 and 313 transcription factors (TFs) may contribute to the regulation of terpenoid and polysaccharide biosynthesis, respectively. Through a heat map analysis, fourteen terpenoid synthetase genes, twenty-three cytochrome P450 oxidase genes, eight methyltransferase genes, and six aminotransferase genes were identified that may be related to dendrobine biosynthesis. Among them, one sesquiterpene synthase gene was found to be highly expressed after the treatment with MeJA and was positively correlated with the content of dendrobine. This study provides important and valuable metabolomics and transcriptomic information for the further understanding of D. nobile at the metabolic and molecular levels and provides candidate genes and possible intermediate compounds for the dendrobine biosynthesis pathway, which lays a certain foundation for further research on and application of Dendrobium.
Collapse
Affiliation(s)
- Daoyong Gong
- College of Bioengineering, Chongqing University, Chongqing 400045, China;
- Institute of Medicinal Plant Development, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100193, China; (B.W.); (H.W.); (S.G.); (G.D.)
| | - Biao Li
- Institute of Medicinal Plant Development, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100193, China; (B.W.); (H.W.); (S.G.); (G.D.)
| | - Bin Wu
- Institute of Medicinal Plant Development, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100193, China; (B.W.); (H.W.); (S.G.); (G.D.)
| | - Deru Fu
- Steinhardt School of Culture, Education, and Human Development, New York University, New York, NY 10003, USA;
| | - Zesheng Li
- Dehong Tropical Agriculture Research Institute of Yunnan, Ruili 678600, China;
| | - Haobo Wei
- Institute of Medicinal Plant Development, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100193, China; (B.W.); (H.W.); (S.G.); (G.D.)
- School of Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Shunxing Guo
- Institute of Medicinal Plant Development, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100193, China; (B.W.); (H.W.); (S.G.); (G.D.)
| | - Gang Ding
- Institute of Medicinal Plant Development, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing 100193, China; (B.W.); (H.W.); (S.G.); (G.D.)
| | - Bochu Wang
- College of Bioengineering, Chongqing University, Chongqing 400045, China;
| |
Collapse
|
11
|
Li Y, Fiehn O. Flash entropy search to query all mass spectral libraries in real time. Nat Methods 2023; 20:1475-1478. [PMID: 37735567 PMCID: PMC11511675 DOI: 10.1038/s41592-023-02012-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 08/15/2023] [Indexed: 09/23/2023]
Abstract
Public repositories of metabolomics mass spectra encompass more than 1 billion entries. With open search, dot product or entropy similarity, comparisons of a single tandem mass spectrometry spectrum take more than 8 h. Flash entropy search speeds up calculations more than 10,000 times to query 1 billion spectra in less than 2 s, without loss in accuracy. It benefits from using multiple threads and GPU calculations. This algorithm can fully exploit large spectral libraries with little memory overhead for any mass spectrometry laboratory.
Collapse
Affiliation(s)
- Yuanyue Li
- West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA, USA
| | - Oliver Fiehn
- West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA, USA.
| |
Collapse
|
12
|
van Herwerden D, O’Brien JW, Lege S, Pirok BWJ, Thomas KV, Samanipour S. Cumulative Neutral Loss Model for Fragment Deconvolution in Electrospray Ionization High-Resolution Mass Spectrometry Data. Anal Chem 2023; 95:12247-12255. [PMID: 37549176 PMCID: PMC10448439 DOI: 10.1021/acs.analchem.3c00896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 07/03/2023] [Indexed: 08/09/2023]
Abstract
Clean high-resolution mass spectra (HRMS) are essential to a successful structural elucidation of an unknown feature during nontarget analysis (NTA) workflows. This is a crucial step, particularly for the spectra generated during data-independent acquisition or during direct infusion experiments. The most commonly available tools only take advantage of the time domain for spectral cleanup. Here, we present an algorithm that combines the time domain and mass domain information to perform spectral deconvolution. The algorithm employs a probability-based cumulative neutral loss (CNL) model for fragment deconvolution. The optimized model, with a mass tolerance of 0.005 Da and a scoreCNL threshold of 0.00, was able to achieve a true positive rate (TPr) of 95.0%, a false discovery rate (FDr) of 20.6%, and a reduction rate of 35.4%. Additionally, the CNL model was extensively tested on real samples containing predominantly pesticides at different concentration levels and with matrix effects. Overall, the model was able to obtain a TPr above 88.8% with FD rates between 33 and 79% and reduction rates between 9 and 45%. Finally, the CNL model was compared with the retention time difference method and peak shape correlation analysis, showing that a combination of correlation analysis and the CNL model was the most effective for fragment deconvolution, obtaining a TPr of 84.7%, an FDr of 54.4%, and a reduction rate of 51.0%.
Collapse
Affiliation(s)
- Denice van Herwerden
- Van
’t Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Amsterdam 1012 WX, The Netherlands
| | - Jake W. O’Brien
- Van
’t Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Amsterdam 1012 WX, The Netherlands
- Queensland
Alliance for Environmental Health Sciences (QAEHS), The University of Queensland, Brisbane 4102, Australia
| | - Sascha Lege
- Agilent
Technologies Deutschland GmbH, Waldbronn 76337, Germany
| | - Bob W. J. Pirok
- Van
’t Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Amsterdam 1012 WX, The Netherlands
| | - Kevin V. Thomas
- Queensland
Alliance for Environmental Health Sciences (QAEHS), The University of Queensland, Brisbane 4102, Australia
| | - Saer Samanipour
- Van
’t Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Amsterdam 1012 WX, The Netherlands
- Queensland
Alliance for Environmental Health Sciences (QAEHS), The University of Queensland, Brisbane 4102, Australia
- UvA
Data Science Center, University of Amsterdam, Amsterdam 1012 WP, The Netherlands
| |
Collapse
|
13
|
Khateb H, Hook AL, Kern S, Watts JA, Singh S, Jackson D, Marinez-Pomares L, Williams P, Alexander MR. Identification of Pseudomonas aeruginosa exopolysaccharide Psl in biofilms using 3D OrbiSIMS. Biointerphases 2023; 18:031007. [PMID: 37255378 PMCID: PMC10234676 DOI: 10.1116/6.0002604] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 04/29/2023] [Accepted: 05/09/2023] [Indexed: 06/01/2023] Open
Abstract
Secondary ion mass spectrometry (SIMS) offers advantages over both liquid extraction mass spectrometry and matrix assisted laser desorption mass spectrometry in that it provides the direct in situ analysis of molecules and has the potential to preserve the 3D location of an analyte in a sample. Polysaccharides are recognized as challenging analytes in the mass spectrometry of liquids and are also difficult to identify and assign using SIMS. Psl is an exopolysaccharide produced by Pseudomonas aeruginosa, which plays a key role in biofilm formation and maturation. In this Letter, we describe the use of the OrbiTrap analyzer with SIMS (3D OrbiSIMS) for the label-free mass spectrometry of Psl, taking advantage of its high mass resolving power for accurate secondary ion assignment. We study a P. aeruginosa biofilm and compare it with purified Psl to enable the assignment of secondary ions specific to the Psl structure. This resulted in the identification of 17 peaks that could confidently be ascribed to Psl fragments within the biofilm matrix. The complementary approach of the following neutral loss sequences is also shown to identify multiple oligosaccharide fragments without the requirement of a biological reference sample.
Collapse
Affiliation(s)
- Heba Khateb
- Advanced Materials and Healthcare Technologies Division, School of Pharmacy, University of Nottingham, Nottingham, United Kingdom
- National Biofilms Innovation Centre, Biodiscovery Institute and School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Andrew L Hook
- Advanced Materials and Healthcare Technologies Division, School of Pharmacy, University of Nottingham, Nottingham, United Kingdom
| | - Stefanie Kern
- Nanoscale and Microscale Research Centre, University of Nottingham, Nottingham, United Kingdom
| | - Julie A Watts
- Nanoscale and Microscale Research Centre, University of Nottingham, Nottingham, United Kingdom
| | - Sonali Singh
- National Biofilms Innovation Centre and School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Darryl Jackson
- National Biofilms Innovation Centre and School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Luisa Marinez-Pomares
- National Biofilms Innovation Centre and School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Paul Williams
- National Biofilms Innovation Centre, Biodiscovery Institute and School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Morgan R Alexander
- Advanced Materials and Healthcare Technologies Division, School of Pharmacy, University of Nottingham, Nottingham, United Kingdom
- National Biofilms Innovation Centre, Biodiscovery Institute and School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| |
Collapse
|
14
|
de Jonge NF, Mildau K, Meijer D, Louwen JJR, Bueschl C, Huber F, van der Hooft JJJ. Good practices and recommendations for using and benchmarking computational metabolomics metabolite annotation tools. Metabolomics 2022; 18:103. [PMID: 36469190 PMCID: PMC9722809 DOI: 10.1007/s11306-022-01963-y] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 11/18/2022] [Indexed: 12/12/2022]
Abstract
BACKGROUND Untargeted metabolomics approaches based on mass spectrometry obtain comprehensive profiles of complex biological samples. However, on average only 10% of the molecules can be annotated. This low annotation rate hampers biochemical interpretation and effective comparison of metabolomics studies. Furthermore, de novo structural characterization of mass spectral data remains a complicated and time-intensive process. Recently, the field of computational metabolomics has gained traction and novel methods have started to enable large-scale and reliable metabolite annotation. Molecular networking and machine learning-based in-silico annotation tools have been shown to greatly assist metabolite characterization in diverse fields such as clinical metabolomics and natural product discovery. AIM OF REVIEW We highlight recent advances in computational metabolite annotation workflows with a special focus on their evaluation and comparison with other tools. Whilst the progress is substantial and promising, we also argue that inconsistencies in benchmarking different tools hamper users from selecting the most appropriate and promising method for their research. We summarize benchmarking strategies of the different tools and outline several recommendations for benchmarking and comparing novel tools. KEY SCIENTIFIC CONCEPTS OF REVIEW This review focuses on recent advances in mass spectral library-based and machine learning-supported metabolite annotation workflows. We discuss large-scale library matching and analogue search, the current bloom of mass spectral similarity scores, and how molecular networking has changed the field. In addition, the potentials and challenges of machine learning-supported metabolite annotation workflows are highlighted. Overall, recent developments in computational metabolomics have started to fundamentally change metabolomics workflows, and we expect that as a community we will be able to overcome current method performance ambiguities and annotation bottlenecks.
Collapse
Affiliation(s)
- Niek F. de Jonge
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
| | - Kevin Mildau
- Department of Analytical Chemistry, Biochemical Network Analysis Lab, University of Vienna, Vienna, Austria
| | - David Meijer
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
| | - Joris J. R. Louwen
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
| | - Christoph Bueschl
- Department of Analytical Chemistry, Biochemical Network Analysis Lab, University of Vienna, Vienna, Austria
| | - Florian Huber
- Centre for Digitalization and Digitality (ZDD), University of Applied Sciences Düsseldorf, Düsseldorf, Germany
| | - Justin J. J. van der Hooft
- Bioinformatics Group, Wageningen University, Wageningen, the Netherlands
- Department of Biochemistry, University of Johannesburg, Johannesburg, South Africa
| |
Collapse
|
15
|
Cai Y, Zhou Z, Zhu ZJ. Advanced analytical and informatic strategies for metabolite annotation in untargeted metabolomics. Trends Analyt Chem 2022. [DOI: 10.1016/j.trac.2022.116903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
16
|
Bittremieux W, Schmid R, Huber F, van der Hooft JJJ, Wang M, Dorrestein PC. Comparison of Cosine, Modified Cosine, and Neutral Loss Based Spectrum Alignment For Discovery of Structurally Related Molecules. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2022; 33:1733-1744. [PMID: 35960544 DOI: 10.1021/jasms.2c00153] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Spectrum alignment of tandem mass spectrometry (MS/MS) data using the modified cosine similarity and subsequent visualization as molecular networks have been demonstrated to be a useful strategy to discover analogs of molecules from untargeted MS/MS-based metabolomics experiments. Recently, a neutral loss matching approach has been introduced as an alternative to MS/MS-based molecular networking with an implied performance advantage in finding analogs that cannot be discovered using existing MS/MS spectrum alignment strategies. To comprehensively evaluate the scoring properties of neutral loss matching, the cosine similarity, and the modified cosine similarity, similarity measures of 955 228 peptide MS/MS spectrum pairs and 10 million small molecule MS/MS spectrum pairs were compared. This comparative analysis revealed that the modified cosine similarity outperformed neutral loss matching and the cosine similarity in all cases. The data further indicated that the performance of MS/MS spectrum alignment depends on the location and type of the modification, as well as the chemical compound class of fragmented molecules.
Collapse
Affiliation(s)
- Wout Bittremieux
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Robin Schmid
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Florian Huber
- Centre for Digitalization and Digitality, University of Applied Sciences, 40476 Düsseldorf, Germany
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University, 6708PB Wageningen, The Netherlands
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa
| | - Mingxun Wang
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|
17
|
Wang F, Allen D, Tian S, Oler E, Gautam V, Greiner R, Metz TO, Wishart DS. CFM-ID 4.0 - a web server for accurate MS-based metabolite identification. Nucleic Acids Res 2022; 50:W165-W174. [PMID: 35610037 PMCID: PMC9252813 DOI: 10.1093/nar/gkac383] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/14/2022] [Accepted: 05/17/2022] [Indexed: 01/31/2023] Open
Abstract
The CFM-ID 4.0 web server (https://cfmid.wishartlab.com) is an online tool for predicting, annotating and interpreting tandem mass (MS/MS) spectra of small molecules. It is specifically designed to assist researchers pursuing studies in metabolomics, exposomics and analytical chemistry. More specifically, CFM-ID 4.0 supports the: 1) prediction of electrospray ionization quadrupole time-of-flight tandem mass spectra (ESI-QTOF-MS/MS) for small molecules over multiple collision energies (10 eV, 20 eV, and 40 eV); 2) annotation of ESI-QTOF-MS/MS spectra given the structure of the compound; and 3) identification of a small molecule that generated a given ESI-QTOF-MS/MS spectrum at one or more collision energies. The CFM-ID 4.0 web server makes use of a substantially improved MS fragmentation algorithm, a much larger database of experimental and in silico predicted MS/MS spectra and improved scoring methods to offer more accurate MS/MS spectral prediction and MS/MS-based compound identification. Compared to earlier versions of CFM-ID, this new version has an MS/MS spectral prediction performance that is ∼22% better and a compound identification accuracy that is ∼35% better on a standard (CASMI 2016) testing dataset. CFM-ID 4.0 also features a neutral loss function that allows users to identify similar or substituent compounds where no match can be found using CFM-ID’s regular MS/MS-to-compound identification utility. Finally, the CFM-ID 4.0 web server now offers a much more refined user interface that is easier to use, supports molecular formula identification (from MS/MS data), provides more interactively viewable data (including proposed fragment ion structures) and displays MS mirror plots for comparing predicted with observed MS/MS spectra. These improvements should make CFM-ID 4.0 much more useful to the community and should make small molecule identification much easier, faster, and more accurate.
Collapse
Affiliation(s)
- Fei Wang
- Department of Computing Science, University of Alberta, Edmonton, AB, T6G 2E8, Canada
| | - Dana Allen
- Department of Biological Sciences, University of Alberta, Edmonton, AB, T6G 2E9, Canada
| | - Siyang Tian
- Department of Biological Sciences, University of Alberta, Edmonton, AB, T6G 2E9, Canada
| | - Eponine Oler
- Department of Biological Sciences, University of Alberta, Edmonton, AB, T6G 2E9, Canada
| | - Vasuk Gautam
- Department of Biological Sciences, University of Alberta, Edmonton, AB, T6G 2E9, Canada
| | - Russell Greiner
- Department of Computing Science, University of Alberta, Edmonton, AB, T6G 2E8, Canada.,Alberta Machine Intelligence Institute, University of Alberta, Edmonton, AB, T6G 2E8, Canada
| | - Thomas O Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - David S Wishart
- Department of Computing Science, University of Alberta, Edmonton, AB, T6G 2E8, Canada.,Department of Biological Sciences, University of Alberta, Edmonton, AB, T6G 2E9, Canada.,Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB, T6G 2B7, Canada.,Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, AB, T6G 2H7, Canada.,Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| |
Collapse
|