1
|
Zhang XY, Gong XQ. Comprehensive and Explainable Fragmentation: A Machine Learning Approach for Fast and Accurate Mass Spectrum Prediction. J Phys Chem A 2025; 129:3552-3559. [PMID: 40194307 DOI: 10.1021/acs.jpca.4c08663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
Mass spectrometry (MS) is a fundamental tool for chemical identification. The current in-silico prediction tools can handle broad instrument conditions, large molecular libraries or fragment structures only on a very limited level. In this work, we propose a dual-model machine learning strategy that can solve this problem by jointly a classification model for fragment identification and noise filtering, and a regression model for spectral prediction. With the help of attention mechanism, our method outperforms other algorithms in accuracy and efficiency, providing a deeper understanding of the molecular fragmentation behavior in mass spectra. Our method can facilitate the large-scale in-silico spectra calculations and the analysis of unknown molecular structures, which may promote wider applications for MS.
Collapse
Affiliation(s)
- Xian-Yang Zhang
- Centre for Computational Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Xue-Qing Gong
- Centre for Computational Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
- School of Chemistry and Chemical Engineering, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| |
Collapse
|
2
|
Gorges J, Grimme S. QCxMS2 - a program for the calculation of electron ionization mass spectra via automated reaction network discovery. Phys Chem Chem Phys 2025; 27:6899-6911. [PMID: 40052418 DOI: 10.1039/d5cp00316d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2025]
Abstract
We present a new fully-automated computational workflow for the calculation of electron ionization mass spectra by automated reaction network discovery, transition state theory and Monte-Carlo simulations. Compared to its predecessor QCxMS [S. Grimme, Angew. Chem., Int. Ed., 52, 6306-6312] based on extensive molecular dynamics (MD) simulations, QCxMS2's more efficient approach of using stationary points on the potential energy surface (PES) enables the usage of accurate quantum chemical methods. Fragment geometries and reaction paths are optimized with fast semi-empirical quantum mechanical (SQM) methods and reaction barriers are refined at the density functional theory (DFT) level. This composite approach using GFN2-xTB geometries in combination with energies at the ωB97X-3c level proved to be an efficient combination. On a small but diverse test set of 16 organic and inorganic molecules, QCxMS2 spectra are more accurate than ones from QCxMS yielding on average a higher mass spectral matching of 0.700 compared to QCxMS with 0.622, and is more robust with a minimal matching of 0.498 versus 0.100. Further improvements were observed when both geometries and energies were computed at the ωB97X-3c level, yielding an average matching score of 0.730 and a minimal score of 0.527. Due to its higher accuracy and robustness while maintaining computational efficiency, we propose QCxMS2 as a complementary, more reliable and systematically improvable successor to QCxMS for elucidating fragmentation pathways and predicting electron ionization mass spectra of unknown chemical substances, e.g., in analytical chemistry applications. If coupled to currently developed improved SQM methods, QCxMS2 opens an efficient route to accurate, and routine mass spectra predictions. The QCxMS2 program suite is freely available on GitHub.
Collapse
Affiliation(s)
- Johannes Gorges
- Mulliken Center for Theoretical Chemistry, Clausius-Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany.
| | - Stefan Grimme
- Mulliken Center for Theoretical Chemistry, Clausius-Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany.
| |
Collapse
|
3
|
Katbashev A, Stahn M, Rose T, Alizadeh V, Friede M, Plett C, Steinbach P, Ehlert S. Overview on Building Blocks and Applications of Efficient and Robust Extended Tight Binding. J Phys Chem A 2025; 129:2667-2682. [PMID: 40013428 DOI: 10.1021/acs.jpca.4c08263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2025]
Abstract
The extended tight binding (xTB) family of methods opened many new possibilities in the field of computational chemistry. Within just 5 years, the GFN2-xTB parametrization for all elements up to Z = 86 enabled more than a thousand applications, which were previously not feasible with other electronic structure methods. The xTB methods provide a robust and efficient way to apply quantum mechanics-based approaches for obtaining molecular geometries, computing free energy corrections or describing noncovalent interactions and found applicability for many more targets. A crucial contribution to the success of the xTB methods is the availability within many simulation packages and frameworks, supported by the open source development of its program library and packages. We present a comprehensive summary of the applications and capabilities of xTB methods in different fields of chemistry. Moreover, we consider the main software packages for xTB calculations, covering their current ecosystem, novel features, and usage by the scientific community.
Collapse
Affiliation(s)
- Abylay Katbashev
- Mulliken Center for Theoretical Chemistry, Clausius Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Marcel Stahn
- Mulliken Center for Theoretical Chemistry, Clausius Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
- OpenEye, Cadence Molecular Sciences, Ebertplatz 1, 50668 Cologne, Germany
| | - Thomas Rose
- Mulliken Center for Theoretical Chemistry, Clausius Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Vahideh Alizadeh
- Mulliken Center for Theoretical Chemistry, Clausius Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
- Center for Advanced Systems Understanding (CASUS), Untermarkt 20, 02826 Görlitz, Germany
| | - Marvin Friede
- Mulliken Center for Theoretical Chemistry, Clausius Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Christoph Plett
- Mulliken Center for Theoretical Chemistry, Clausius Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Pit Steinbach
- Institute of Physical Chemistry, RWTH Aachen University, Melatener Str. 20, 52074 Aachen, Germany
| | - Sebastian Ehlert
- AI for Science, Microsoft Research, Evert van de Beekstraat 354, 1118 CZ Schiphol, The Netherlands
| |
Collapse
|
4
|
Lee J, Tantillo DJ, Wang LP, Fiehn O. Predicting Collision-Induced-Dissociation Tandem Mass Spectra (CID-MS/MS) Using Ab Initio Molecular Dynamics. J Chem Inf Model 2024; 64:7470-7487. [PMID: 39329407 PMCID: PMC11492810 DOI: 10.1021/acs.jcim.4c00760] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/28/2024]
Abstract
Compound identification is at the center of metabolomics, usually by comparing experimental mass spectra against library spectra. However, most compounds are not commercially available to generate library spectra. Hence, for such compounds, MS/MS spectra need to be predicted. Machine learning and heuristic models have largely failed except for lipids. Here, quantum chemistry software can be used to predict mass spectra. However, quantum chemistry predictions for collision induced dissociation (CID) mass spectra in LC-MS/MS are rare. We present the CIDMD (Collision-Induced Dissociation via Molecular Dynamics) framework to model CID-based MS/MS spectra. It uses first-principles molecular dynamics (MD) to simulate the physical process of molecular collisions in CID tandem mass spectrometry. First, molecular ions are constructed at specific protonation sites. Using density functional theory, these protonated ions are targeted by argon collider gas atoms at user-specified velocities. Subsequent bond breakages are simulated over time for at least 1,000 fs. Each simulation is repeated multiple times from various collisional directions. Fragmentations are accumulated over those repeated collisions to generate CIDMD in silico mass spectra. Twelve small metabolites (<205 Da) were selected to test the accuracy of this framework in comparison to experimental MS/MS spectra. When testing different protomers, collider velocities, number of simulations, simulation time and impact factor b cutoffs, we yielded 261 predicted mass spectra. These in silico spectra resulted in entropy similarity scores of an average 624 ± 189 for all 261 spectra compared to their corresponding experimental spectra, which improved to 828 ± 77 when using optimal parameters of the most probable protomers for 12 molecules. With increasing molecular mass, higher velocities achieved better results. Similarly, different protomers showed large differences in fragmentation; hence, with increasing numbers of protomers and tautomers, the average CIDMD prediction accuracy decreased. Mechanistic details showed that specific fragment ions can be produced from different protomers via multiple fragmentation pathways. We propose that CIDMD is a suitable tool to predict mass spectra of small metabolites like produced by the gut microbiome.
Collapse
Affiliation(s)
- Jesi Lee
- Department of Chemistry, University of California, Davis, California 95616, United States
- West Coast Metabolomics Center, University of California, Davis, California 95616, United States
| | - Dean Joseph Tantillo
- Department of Chemistry, University of California, Davis, California 95616, United States
| | - Lee-Ping Wang
- Department of Chemistry, University of California, Davis, California 95616, United States
| | - Oliver Fiehn
- West Coast Metabolomics Center, University of California, Davis, California 95616, United States
| |
Collapse
|
5
|
Wang S, Lin C, Zhao L, Gong X, Zhang M, Zhang H, Hu P. Identifying isomers in Chinese traditional medicine via density functional theory and ion fragmentation simulation software QCxMS. J Chromatogr A 2024; 1730:465122. [PMID: 38941796 DOI: 10.1016/j.chroma.2024.465122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Revised: 06/22/2024] [Accepted: 06/24/2024] [Indexed: 06/30/2024]
Abstract
In the realm of electrospray ionization mass spectrometry (ESI-MS), distinguishing among isomers poses a significant challenge due to the minimal spectral differences that often arise from their subtle structural differences. This makes the accurate identification of these compounds through solely experimental spectra a daunting task. Computational chemistry has emerged as a pivotal tool in bridging the gap between experimental observations and theoretical understanding. This study used the MS fragmentation simulation software, QCxMS, to model the spectra of five groups of isomers, encompassing 11 compounds, found in the traditional Chinese medicine, Zhishi Xiebai Guizhi Decoction. By comparing the spectra predicted through computational methods with those derived from Ultra-high performance liquid chromatography-quadrupole-time of flight-mass spectrometry (UPLC-Q-TOF-MS) experiments, it was observed that, following the optimization of simulation parameters, QCxMS was capable of generating reliable spectra for all examined compounds. Notably, the data calculated under both GFN1-xTB and GFN2-xTB levels exhibited no significant discrepancies. Further analysis enabled the identification of the principal fragments of the 11 compounds from the theoretical data, facilitating the deduction of their fragmentation pathways. The Density Functional Theory (DFT) method was subsequently applied to compute the primary fragmentation energies of these compounds. The findings revealed a congruence between the energy data calculated using both thermodynamic and kinetic approaches and the observed fragment abundance of the isomers. This alignment providing a more precise theoretical framework for understanding the mechanisms underlying the generation of fragment ion differences among isomers.
Collapse
Affiliation(s)
- Shuai Wang
- Shanghai Key Laboratory of Functional Materials Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Chuhui Lin
- Shanghai Key Laboratory of Functional Materials Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Linghao Zhao
- Shanghai Key Laboratory of Functional Materials Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Xueqing Gong
- School of Chemistry and Chemical Engineering, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Min Zhang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Hongyang Zhang
- Shanghai Key Laboratory of Functional Materials Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Ping Hu
- Shanghai Key Laboratory of Functional Materials Chemistry, School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai 200237, China.
| |
Collapse
|
6
|
Kirschbaum C, Greis K, Torres-Boy A, Riedel J, Gewinner S, Schöllkopf W, Meijer G, Helden GV, Pagel K. Studying the Intrinsic Reactivity of Chromanes by Gas-Phase Infrared Spectroscopy. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1950-1958. [PMID: 38950388 PMCID: PMC11311547 DOI: 10.1021/jasms.4c00216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 06/11/2024] [Accepted: 06/19/2024] [Indexed: 07/03/2024]
Abstract
Tandem mass spectrometry is routinely used for the structural analysis of organic molecules, but many fragmentation reactions are not well understood. Because several potential structures can correspond to a measured mass, the assignment of product ions is ambiguous using mass spectrometry alone. Here, we combine mass spectrometry with high-resolution gas-phase infrared spectroscopy and computational chemistry tools to identify product ion structures and derive collision-induced fragmentation mechanisms of the chromane derivatives Trolox and Methyltrolox. We find that protonated Trolox and Methyltrolox fragment identically via dehydration and decarbonylation, while deprotonated ions display substantially diverging reactivities. For deprotonated Methyltrolox, we observe unusual radical fragmentation reactions and suggest a [1,2]-Wittig rearrangement involving aryl migration in the gas phase. Overall, the combined experimental and theoretical approach presented here revealed complex proton dynamics and intramolecular rearrangement reactions, which expand our understanding on structure-reactivity relationships of isolated molecules in different protonation states.
Collapse
Affiliation(s)
- Carla Kirschbaum
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Kim Greis
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | | | - Jerome Riedel
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Sandy Gewinner
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | | | - Gerard Meijer
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Gert von Helden
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| | - Kevin Pagel
- Freie
Universität Berlin, Institute of Chemistry
and Biochemistry, 14195 Berlin, Germany
- Fritz
Haber Institute of the Max Planck Society, 14195 Berlin, Germany
| |
Collapse
|
7
|
Zhang B, Hao M, Xiong J, Li X, Koopman J. Ab initio molecular dynamics calculations on electron ionization induced fragmentations of C 4F 7N and C 5F 10O for understanding their decompositions under discharge conditions. Phys Chem Chem Phys 2023; 25:7540-7549. [PMID: 36857631 DOI: 10.1039/d2cp03498k] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
Abstract
C4F7N and C5F10O are the most promising SF6 alternatives as eco-friendly insulating gaseous mediums in electrical engineering. It is necessary to clarify their electrical stability and decomposition mechanisms. In this work, we first introduced our experimental results for decomposition products of C4F7N/CO2 and C5F10O/synthetic air mixtures under partial discharge and spark discharge conditions. Then, we performed ab initio molecular dynamics (AIMD) simulations on the typical decomposition products. The simulations were performed under standard electron impact mass spectrometry (EI-MS); thus, the statistical results of the mass spectra were compared with those of the experimentally obtained standard mass spectra from the NIST database. The AIMD simulation method in simulating the electron-induced ionization process was verified and found to be reliable. Finally, the calculations were also performed for C4F7N and C5F10O with incident electron energies of 20 eV and 70 eV, respectively. The dominant pathway for both gases is the formation of CF3+ with the fracture of the C-C bond. The AIMD simulation is able to predict the decomposition channels after electron-impact ionization without any preconceived knowledge of fragmentation pathways, which provides a novel insight into understanding the decomposition mechanisms of C4F7N and C5F10O under different discharge conditions with different energies.
Collapse
Affiliation(s)
- Boya Zhang
- State Key Laboratory of Electrical Insulation and Power Equipment, Xi'an Jiaotong University, Xi'an, Shaanxi Province 710049, China.
| | - Mai Hao
- State Key Laboratory of Electrical Insulation and Power Equipment, Xi'an Jiaotong University, Xi'an, Shaanxi Province 710049, China.
| | - Jiayu Xiong
- State Key Laboratory of Electrical Insulation and Power Equipment, Xi'an Jiaotong University, Xi'an, Shaanxi Province 710049, China. .,State Grid Sichuan Electric Power Research Institute, Chengdu, Sichuan Province 610041, China
| | - Xingwen Li
- State Key Laboratory of Electrical Insulation and Power Equipment, Xi'an Jiaotong University, Xi'an, Shaanxi Province 710049, China.
| | - Jeroen Koopman
- Mulliken Center for Theoretical Chemistry, Institute of Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| |
Collapse
|
8
|
Fu D, Habtegabir SG, Wang H, Feng S, Han Y. Understanding of protomers/deprotomers by combining mass spectrometry and computation. Anal Bioanal Chem 2023:10.1007/s00216-023-04574-1. [PMID: 36737499 DOI: 10.1007/s00216-023-04574-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 01/19/2023] [Accepted: 01/27/2023] [Indexed: 02/05/2023]
Abstract
Multifunctional compounds may form different prototropic isomers under different conditions, which are known as protomers/deprotomers. In biological systems, these protomer/deprotomer isomers affect the interaction modes and conformational landscape between compounds and enzymes and thus present different biological activities. Study on protomers/deprotomers is essentially the study on the acidity/basicity of each intramolecular functional group and its effect on molecular structure. In recent years, the combination of mass spectrometry (MS) and computational chemistry has been proven to be a powerful and effective means to study prototropic isomers. MS-based technologies are developed to discriminate and characterize protomers/deprotomers to provide structural information and monitor transformations, showing great superiority than other experimental methods. Computational chemistry is used to predict the thermodynamic stability of protomers/deprotomers, provide the simulated MS/MS spectra, infrared spectra, and calculate collision cross-section values. By comparing the theoretical data with the corresponding experimental results, the researchers can not only determine the protomer/deprotomer structure, but also investigate the structure-activity relationship in a given system. This review covers various MS methods and theoretical calculations and their devotion to isomer discrimination, structure identification, conformational transformation, and phase transition investigation of protomers/deprotomers.
Collapse
Affiliation(s)
- Dali Fu
- State Key Laboratory of Heavy Oil Processing, College of Chemical Engineering and Environment, China University of Petroleum-Beijing, Beijing, 102249, People's Republic of China
| | - Sara Girmay Habtegabir
- State Key Laboratory of Heavy Oil Processing, College of Chemical Engineering and Environment, China University of Petroleum-Beijing, Beijing, 102249, People's Republic of China
| | - Haodong Wang
- State Key Laboratory of Heavy Oil Processing, College of Chemical Engineering and Environment, China University of Petroleum-Beijing, Beijing, 102249, People's Republic of China
| | - Shijie Feng
- State Key Laboratory of Heavy Oil Processing, College of Chemical Engineering and Environment, China University of Petroleum-Beijing, Beijing, 102249, People's Republic of China
| | - Yehua Han
- State Key Laboratory of Heavy Oil Processing, College of Chemical Engineering and Environment, China University of Petroleum-Beijing, Beijing, 102249, People's Republic of China.
| |
Collapse
|