1
|
Osipenko S, Bashilov A, Vishnevskaya A, Rumiantseva L, Levashova A, Kovalenko A, Tupertsev B, Kireev A, Nikolaev E, Kostyukevich Y. Investigating the Metabolism of Plants Germinated in Heavy Water, D 2O, and H 218O-Enriched Media Using High-Resolution Mass Spectrometry. Int J Mol Sci 2023; 24:15396. [PMID: 37895078 PMCID: PMC10607710 DOI: 10.3390/ijms242015396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 08/23/2023] [Accepted: 08/30/2023] [Indexed: 10/29/2023] Open
Abstract
Mass spectrometry has been an essential technique for the investigation of the metabolic pathways of living organisms since its appearance at the beginning of the 20th century. Due to its capability to resolve isotopically labeled species, it can be applied together with stable isotope tracers to reveal the transformation of particular biologically relevant molecules. However, low-resolution techniques, which were used for decades, had limited capabilities for untargeted metabolomics, especially when a large number of compounds are labelled simultaneously. Such untargeted studies may provide new information about metabolism and can be performed with high-resolution mass spectrometry. Here, we demonstrate the capabilities of high-resolution mass spectrometry to obtain insights on the metabolism of a model plant, Lepidium sativum, germinated in D2O and H218O-enriched media. In particular, we demonstrated that in vivo labeling with heavy water helps to identify if a compound is being synthesized at a particular stage of germination or if it originates from seed content, and tandem mass spectrometry allows us to highlight the substructures with incorporated isotope labels. Additionally, we found in vivo labeling useful to distinguish between isomeric compounds with identical fragmentation patterns due to the differences in their formation rates that can be compared by the extent of heavy atom incorporation.
Collapse
Affiliation(s)
- Sergey Osipenko
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Anton Bashilov
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
- Institute for Translational Medicine and Biotechnology, First Moscow State Medical University, 119991 Moscow, Russia
| | - Anna Vishnevskaya
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Lidiia Rumiantseva
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Anna Levashova
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Anna Kovalenko
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Boris Tupertsev
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Albert Kireev
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Eugene Nikolaev
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| | - Yury Kostyukevich
- Skolkovo Institute of Science and Technology, Bolshoy Boulevard 30, Bld. 1, 121205 Moscow, Russia; (S.O.); (A.B.); (A.V.); (L.R.); (A.L.); (A.K.); (B.T.); (A.K.); (E.N.)
| |
Collapse
|
2
|
Guo R, Zhang Y, Liao Y, Yang Q, Xie T, Fan X, Lin Z, Chen Y, Lu H, Zhang Z. Highly accurate and large-scale collision cross sections prediction with graph neural networks. Commun Chem 2023; 6:139. [PMID: 37402835 DOI: 10.1038/s42004-023-00939-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 06/23/2023] [Indexed: 07/06/2023] Open
Abstract
The collision cross section (CCS) values derived from ion mobility spectrometry can be used to improve the accuracy of compound identification. Here, we have developed the Structure included graph merging with adduct method for CCS prediction (SigmaCCS) based on graph neural networks using 3D conformers as inputs. A model was trained, evaluated, and tested with >5,000 experimental CCS values. It achieved a coefficient of determination of 0.9945 and a median relative error of 1.1751% on the test set. The model-agnostic interpretation method and the visualization of the learned representations were used to investigate the chemical rationality of SigmaCCS. An in-silico database with 282 million CCS values was generated for three different adduct types of 94 million compounds. Its source code is publicly available at https://github.com/zmzhang/SigmaCCS . Altogether, SigmaCCS is an accurate, rational, and off-the-shelf method to directly predict CCS values from molecular structures.
Collapse
Affiliation(s)
- Renfeng Guo
- College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China
| | - Youjia Zhang
- School of Computer Science and Technology, Huazhong University of Science and Technology, 430074, Wuhan, China
| | - Yuxuan Liao
- College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China
| | - Qiong Yang
- College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China
| | - Ting Xie
- College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China
| | - Xiaqiong Fan
- College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China
| | - Zhonglong Lin
- Yunnan Academy of Tobacco Agricultural Sciences, 650021, Kunming, Yunnan, China
| | - Yi Chen
- Yunnan Academy of Tobacco Agricultural Sciences, 650021, Kunming, Yunnan, China.
| | - Hongmei Lu
- College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China.
| | - Zhimin Zhang
- College of Chemistry and Chemical Engineering, Central South University, 410083, Changsha, China.
| |
Collapse
|
3
|
Tupertsev B, Osipenko S, Kireev A, Nikolaev E, Kostyukevich Y. Simple In Vitro 18O Labeling for Improved Mass Spectrometry-Based Drug Metabolites Identification: Deep Drug Metabolism Study. Int J Mol Sci 2023; 24:ijms24054569. [PMID: 36902002 PMCID: PMC10002766 DOI: 10.3390/ijms24054569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 02/21/2023] [Accepted: 02/23/2023] [Indexed: 03/03/2023] Open
Abstract
The identification of drug metabolites formed with different in vitro systems by HPLC-MS is a standard step in preclinical research. In vitro systems allow modeling of real metabolic pathways of a drug candidate. Despite the emergence of various software and databases, identification of compounds is still a complex task. Measurement of the accurate mass, correlation of chromatographic retention times and fragmentation spectra are often insufficient for identification of compounds especially in the absence of reference materials. Metabolites can "slip under the nose", since it is often not possible to reliably confirm that a signal belongs to a metabolite and not to other compounds in complex systems. Isotope labeling has proved to be a tool that aids in small molecule identification. The introduction of heavy isotopes is done with isotope exchange reactions or with complicated synthetic schemes. Here, we present an approach based on the biocatalytic insertion of oxygen-18 isotope under the action of liver microsomes enzymes in the presence of 18O2. Using the local anesthetic bupivacaine as an example, more than 20 previously unknown metabolites were reliably discovered and annotated in the absence of the reference materials. In combination with high-resolution mass spectrometry and modern methods of mass spectrometric metabolism data processing, we demonstrated the ability of the proposed approach to increase the degree of confidence in interpretating metabolism data.
Collapse
Affiliation(s)
- Boris Tupertsev
- Center of Molecular and Cellular Biology (CMCB), Skolkovo Institute of Science and Technology, Nobel Str., 3, 121205 Moscow, Russia
- Moscow Institute of Physics and Technology, Phystech School of Biological and Medical Physics, Institutskiy per., 9, Dolgoprudny, 141701 Moscow, Russia
| | - Sergey Osipenko
- Center for Computational and Data-Intensive Science and Engineering, Skolkovo Institute of Science and Technology, Nobel Str., 3, 121205 Moscow, Russia
| | - Albert Kireev
- Center of Molecular and Cellular Biology (CMCB), Skolkovo Institute of Science and Technology, Nobel Str., 3, 121205 Moscow, Russia
| | - Eugene Nikolaev
- Center of Molecular and Cellular Biology (CMCB), Skolkovo Institute of Science and Technology, Nobel Str., 3, 121205 Moscow, Russia
| | - Yury Kostyukevich
- Center of Molecular and Cellular Biology (CMCB), Skolkovo Institute of Science and Technology, Nobel Str., 3, 121205 Moscow, Russia
- Correspondence:
| |
Collapse
|
4
|
Wang X, Zheng F, Sheng M, Xu G, Lin X. Retention time prediction for small samples based on integrating molecular representations and adaptive network. J Chromatogr B Analyt Technol Biomed Life Sci 2023; 1217:123624. [PMID: 36780745 DOI: 10.1016/j.jchromb.2023.123624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 01/13/2023] [Accepted: 01/27/2023] [Indexed: 02/07/2023]
Abstract
Retention time (RT) can provide orthogonal information different from that of mass spectrometry and contribute to identifying compounds. Many machine learning methods have been developed and applied to RT prediction. In application, the training data size is usually small in most chromatography systems. To enhance the performance of RT prediction, this study proposes a RT prediction method based on multi-data combinations and adaptive neural network (MDC-ANN). MDC-ANN establishes the RT prediction model for the target chromatographic system through transfer learning and a base deep learning model trained on a big dataset. It selects the optimal molecular representation combination from the multiple input candidates and automatically determines the neural network structure according to the determined input combination. MDC-ANN was compared with two new efficient deep learning methods, three transferring methods and four popular machine learning methods on 14 small datasets and showed advantages in MAE, MedAE, MRE and R2 in most cases. The experiment results illustrated that integrating multiple molecular representations can provide more information, improve the performance of RT prediction and contribute to compound annotation, different chromatographic systems may use different molecular representation combinations to obtain good RT prediction performance. Hence, MDC-ANN which automatically determines the best combination of molecular representations for a specific system is promising for predicting RTs accurately in real applications.
Collapse
Affiliation(s)
- Xiaoxiao Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China
| | - Fujian Zheng
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, Liaoning, China.
| | - Meizhen Sheng
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China
| | - Guowang Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, Liaoning, China
| | - Xiaohui Lin
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China.
| |
Collapse
|
5
|
Retention Time Prediction with Message-Passing Neural Networks. SEPARATIONS 2022. [DOI: 10.3390/separations9100291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023] Open
Abstract
Retention time prediction, facilitated by advances in machine learning, has become a useful tool in untargeted LC-MS applications. State-of-the-art approaches include graph neural networks and 1D-convolutional neural networks that are trained on the METLIN small molecule retention time dataset (SMRT). These approaches demonstrate accurate predictions comparable with the experimental error for the training set. The weak point of retention time prediction approaches is the transfer of predictions to various systems. The accuracy of this step depends both on the method of mapping and on the accuracy of the general model trained on SMRT. Therefore, improvements to both parts of prediction workflows may lead to improved compound annotations. Here, we evaluate capabilities of message-passing neural networks (MPNN) that have demonstrated outstanding performance on many chemical tasks to accurately predict retention times. The model was initially trained on SMRT, providing mean and median absolute cross-validation errors of 32 and 16 s, respectively. The pretrained MPNN was further fine-tuned on five publicly available small reversed-phase retention sets in a transfer learning mode and demonstrated up to 30% improvement of prediction accuracy for these sets compared with the state-of-the-art methods. We demonstrated that filtering isomeric candidates by predicted retention with the thresholds obtained from ROC curves eliminates up to 50% of false identities.
Collapse
|
6
|
Comparative Prediction of Gas Chromatographic Retention Indices for GC/MS Identification of Chemicals Related to Chemical Weapons Convention by Incremental and Machine Learning Methods. SEPARATIONS 2022. [DOI: 10.3390/separations9100265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
During on-site verification activities conducted by the Technical Secretariat of Organization for the Prohibition of Chemical Weapons, identification by gas chromatography retention indices (RI) data, in addition to mass spectrometry data, increase the reliability of factual findings. However, reference RIs do not cover all the possible chemical structures. That is why it is important to have models to predict RIs. Applicable only for narrow data sets of chemicals with a fixed scaffold (G- and V-series gases as example), the non-learning incremental method demonstrated predictive median absolute and percentage errors of 2–4 units and 0.1–0.2%; these are comparable with the experimental bias in RI measurements in the same laboratory with the same GC conditions. It outperforms the accuracy of two reported machine learning methods–median absolute and percentage errors of 11–52 units and 0.5–2.8%. However, for the whole Chemical Weapons Convention (CWC) data set of chemicals, when a fixed scaffold is absent, the incremental method is not applicable; essential machine learning methods achieved accuracy: median absolute and percentage errors of 29–33 units and 0.5–2.2%, depending on the machine learning method. In addition, we have developed a homology tree approach as a convenient method for the visualization of the CWC chemical space. We conclude that non-learning incremental methods may be more accurate than the state-of-the-art machine learning techniques in particular cases, such as predicting the RIs of homologues and isomers of chemicals related to CWC.
Collapse
|
7
|
Tian Z, Liu F, Li D, Fernie AR, Chen W. Strategies for structure elucidation of small molecules based on LC–MS/MS data from complex biological samples. Comput Struct Biotechnol J 2022; 20:5085-5097. [PMID: 36187931 PMCID: PMC9489805 DOI: 10.1016/j.csbj.2022.09.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 09/03/2022] [Accepted: 09/03/2022] [Indexed: 11/06/2022] Open
Abstract
LC–MS/MS is a major analytical platform for metabolomics, which has become a recent hotspot in the research fields of life and environmental sciences. By contrast, structure elucidation of small molecules based on LC–MS/MS data remains a major challenge in the chemical and biological interpretation of untargeted metabolomics datasets. In recent years, several strategies for structure elucidation using LC–MS/MS data from complex biological samples have been proposed, these strategies can be simply categorized into two types, one based on structure annotation of mass spectra and for the other on retention time prediction. These strategies have helped many scientists conduct research in metabolite-related fields and are indispensable for the development of future tools. Here, we summarized the characteristics of the current tools and strategies for structure elucidation of small molecules based on LC–MS/MS data, and further discussed the directions and perspectives to improve the power of the tools or strategies for structure elucidation.
Collapse
|
8
|
Ju R, Liu X, Zheng F, Lu X, Xu G, Lin X. Deep Neural Network Pretrained by Weighted Autoencoders and Transfer Learning for Retention Time Prediction of Small Molecules. Anal Chem 2021; 93:15651-15658. [PMID: 34780148 DOI: 10.1021/acs.analchem.1c03250] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Retention time (RT) prediction contributes to identification of small molecules measured by high-performance liquid chromatography coupled with high-resolution mass spectrometry. Deep learning algorithms based on big data can enhance the accuracy of RT prediction. But at different chromatographic conditions, RTs of compounds are different, and the number of compounds with known RTs is small in most cases. Therefore, the transfer of big data is necessary. In this work, a strategy using a deep neural network (DNN) pretrained by weighed autoencoders and transfer learning (DNNpwa-TL) was proposed to efficiently predict RTs of compounds. The loss function in the autoencoders was calculated with features weighted by mutual information. Then, a DNN pretrained by weighted autoencoders (DNNpwa) was produced. For other specific chromatographic methods, the transfer learning model DNNpwa-TLs were built through fine-tuning the DNNpwa with the help of some compounds with known RTs to conduct the RT prediction. With the above strategy, a DNNpwa was first built with the METLIN small molecule retention time data set containing 80 038 small molecule compounds. A median relative error of 3.1% and a mean relative error of 4.9% were achieved. Then, 17 data sets from different chromatographic methods were studied, and the results showed that the performance of DNNpwa-TL was better than those of other deep learning models. Besides, DNNpwa-TL outperformed random forest, gradient boost, least absolute shrinkage and selection operator regression, and DNN for most of the 17 data sets. Therefore, DNNpwa-TL can provide an efficient method to perform RT prediction of small molecule compounds for different chromatographic methods and conditions.
Collapse
Affiliation(s)
- Ran Ju
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Xinyu Liu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Fujian Zheng
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Xin Lu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Guowang Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| | - Xiaohui Lin
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China
| |
Collapse
|